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Field of the Invention 
The invention relates generally to /3(l,4)-galactosyltransferase I mutants 
having altered donor and acceptor specificities, and methods of use thereof. In 
addition, the invention relates to methods for using the j3(l,4)- 
20 galactosyltransferase I mutants to increase the immunogenicity of an antigen, 
such as vaccines, and for synthesizing saccharide compositions. 



Background of the Invention 
Oligosaccharides are chains composed of saccharide units, which are 
25 commonly known as sugars. Of the biological polymer families, 

oligosaccharides are the least studied, due in part to the difficulty of sequencing 
and synthesizing their complex sugar chain. Currently, no generally applicable 
synthetic technique for synthesizing oligosaccharides is available. 

Intensive research efforts have been devoted to carbohydrates and 
30 molecules comprising carbohydrate fragments, such as glycolipids and 

glycoproteins. Research interest in these moieties has been largely due to the 
recognition that interaction between proteins and carbohydrates are involved in a 
wide array of biological recognition events, including fertilization, molecular 
targeting, intracellular recognition, and viral, bacterial, and fungal pathogenesis. 
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It is now widely appreciated that the oligosaccharide portions of glycoproteins 
and glycolipids mediate cell-cell interactions, cell-ligand interactions, cell- 
extracellular matrix interactions, and cell-pathogen interactions. 

It is thought that many of these interactions can be inhibited by 

5 oligosaccharides that have the same sugar sequence and stereochemistry found 
on the active portion of a glycoprotein or glycolipid involved in the interactions. 
The oligosaccharides are believed to compete with the glycoproteins and 
glycolipids for binding sites on the receptor proteins. For example, the 
disaccharide galactosyl-|8(l,4)-N-acetylglucosainine is believed to be one 

1 0 component of the glycoprotein which interacts with receptors in the plasma 
membrane of liver cells. Thus, oligosaccharides and other saccharide 
compositions that mimic ligands recognized and bound by cellular receptors are 
thought to be useful in applications that include diagnostics and therapeutics. 
In addition to mediating numerous cellular interactions, many 

15 oligosaccharides are recognized by the immune system. For example, Anti-Gal, 
a naturally occurring antibody present in all humans, specifically interacts with 
the carbohydrate epitope Gal-o(l-3)Gal-i8(l-4)GlcNAc-R (a-galactosyl epitope). 
This antibody does not interact with any other known carbohydrate epitope 
produced by mammalian cells (Galili, Springer Seminar Immunopathology, 

20 15:153 (1993)). Anti-Gal constitutes approximately 1% of circulating IgG (Galili 
et aL, J. Exp. Med., 160:1519 (1984)) and is also found in the form of IgA and 
IgM (Davine et aL, Kidney Int., 31:1132 (1987); Sandrin et aL, Proc. NatL Acad. 
ScL, 90:11391 (1993)). It is produced by 1% of circulating B-lymphocytes 
(Galili et aL, Blood, 82:2485 (1993)). Accordingly, the ability of carbohydrates 

25 to elicit an immune response can be utilized to increase the effectiveness of 
vaccines against many types of pathogens by linking such a carbohydrate to a 
vacccine to increase the immune response to the vaccine. 

There has been relatively little effort to test oligosaccharides as 
therapeutic agents for humans or animal diseases however, as methods to 

30 synthesize oligosaccharides have been unavailable as noted above. Limited 

types of small oligosaccharides can be custom-synthesized by organic chemical 
methods, but the cost of such compounds is typically prohibitively high. In 
addition, it is very difficult to synthesize oligosaccharides stereospecifically and 
the addition of some sugars, such as sialic acid and fucose, has not been 
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effectively accomplished because of the extreme lability of their bonds. 
Improved, generally applicable methods for oligosaccharide synthesis are 
thereby desired for the production of large amounts of widely varying 
oligosaccharides for therapeutic purposes. Accordingly, the present invention 
5 provides enzymes and methods that can be used to promote the chemical linkage 
of numerous sugars that have previously been difficult to link. 

Summary of the Invention 
The invention provides altered /?(l,4)-galactosyltransferase I catalytic 
1 0 domains that catalyze the formation of a glucose-j3(l ,4)-N-acetylglucosamine 
bond at a greater rate than the catalytic domain of the corresponding wild-type 
enzyme. The invention also provides j3(l,4)-galactosyltransferase I catalytic 
domains that catalyze formation of N-acetylgalactosamine-j8(l,4)-N- 
acetylglucosamine bonds; N-acetylgalactosamine-0(l,4)-glucose bonds; N- 
1 5 acetylglucosamine-j3(l ,4)-N-acetylglucosamine bonds; mannose-j8(l ,4)-N- 
acetylglucosamine bonds; and galactose-/3(l,4)-N-acetylglucosamine-6-S03 
bonds. The invention also provides polypeptides that contain each of the 
aforementioned catalytic domains. 

Also provided by the invention are amino acid segments that promote in 
20 vitro folding of catalytic domains of galactosyltransferases, such as ]3(1,4)- 
galactosyltransferase I, and mutants of galactosyltransferases. 

The invention provides nucleic acid segments that encode the 
aforementioned j3(l,4)-galactosyltransferase I catalytic domains. Expression 
cassettes and cells that include nucleic acid segments that encode the 
25 aforementioned j3(l,4)-galactosyltransferase I catalytic domains are also 
provided. 

Additionally provided are methods to synthesize a glucose-|8(l,4)-N- 
acetylglucosamine moiety; anN"acetylgalactosamine-j3(l,4)--N- 
acetylglucosamine moiety; an N-acetylgalactosamine-j8(l,4)-glucose moiety; an 
30 N-acetylglucosamin&-j8(l 5 4)-N--acetylglucosamine moiety; a mannose-/?(l,4)-N- 
acetylglucosamine moiety; and a galactose-j8(l,4)-N-acetylglucosamine-6-S03 
moiety. 



3 
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The invention also provides methods to increase the iinmunogenicity of 
an antigen, and methods to prepare an oligosaccharide composition, including 
those having a defined sequence. 

Further provided hy the invention are oligosaccharides produced through 
5 use of the catalytic domains and methods disclosed herein. 

Brief Description of the Figures 
Fig. 1 illustrates the catalytic activity of the wt-Gal-Tl and the mutants 
Y289L, Y289I, and Y289N. (A) Gal-T activity of wt-Gal-Tl and of mutants at 

10 the saturating concentrations of yS-benzyl-GlcNAc as the acceptor. (B) GalNAc- 
T activity of the wt-Gal-Tl and of mutants. The activities are measured with 50 
mM y^benzyl-GlcNAc as acceptors at different concentrations of the donor. The 
Gal-T activity of the wt-Gal-Tl was measured with only 10 mM ^benzyl- 
GlcNAc because it exhibited inhibition at 50 mM concentration. 

1 5 FIG. 2 illustrates the partial *H NMR spectrum of the disaccharide 

(LacdiNAc) product from the GalNAc-T reaction with GlcNAc as the acceptor. 
The signal for the GalNAc anomeric proton is at 5 4.58 ppm and for the GlcNAc 
anomeric proton corresponding to an a and a ft conformer is at 85.2 and 4.7 
ppm. The signals from the acetyl group of each sugar moiety are at <52.05 and 

20 2.08 ppm. 

Fig. 3 illustrates the cDNA sequence of the inserts and the derived 
protein sequences of the constructs in the pET23a vector used for studying the 
effect of the TV-terminal stem region (SR) on the solubility and folding of the 
catalytic domain (CD) of (34Gal-T. (A) Sequence of bovine SRCD p4Gal-Tl 

25 having Ser 96 in the wild-type of p4Gal-Tl mutated to Ala 96. (B) Sequence of 
human SRCDP4Gal -Tl. The numbers above the amino acids correspond to the 
residues that delineate the stem region. An 1 1 amino acid extension (not shown) 
is located at the amino-terminal end of the protein and is coded by the pET23a 
vector leader sequence. The stem region sequence is shaded, followed by the 

30 catalytic domain (CD) sequence. The primers used for PCR amplification are 
underlined. 

Fig. 4 shows an SDS-PAGE analysis of the recombinant CD and SRCD 
proteins of p4Gal-T produced in E. coli. Purified inclusion bodies were 
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analyzed on 14% SDS-PAGE. Lane 1, bovine CDp4Gal-Tl; Lane 2, bovine 
SRCD |34Gal-Tl; Lane 3, human CDp4Gal-Tl; and Lane 4, human 
SRCDp4Gal-Tl. 

Fig. 5 shows folding of inclusion bodies under different renaturation 
5 conditions: the relative activities of CDp4Gal-Tl and SRCDP4Gal-Tl. The 
relative activity was calculated with respect to the folding condition I. The 
largest amount of folded CDp4Gal-Tl and SRCD[34Gal-Tl was obtained under 
condition VIII. The enzyme activity under condition I for CDP4Gal-Tl was 
0.052 pmol/min/ng, and for SRCDp4Gal-Tl 0.09 pmol/min/ng of N- 
10 acetyllactosamine. 

Fig. 6 shows analysis of the soluble proteins obtained after renaturation 
of inclusion bodies by SDS-PAGE electrophoresis. Under native conditions (-), 
(without boiling and in the absence of p-ME) and under denaturing conditions 
(+) (samples boiled in the presence of p-ME). Lane 1 : bovine CDp4Gal-Tl ; 
15 Lane 2: bovine SRCDP4Gal-Tl; Lane 3: human CDp4Gal-Tl; and Lane 4: 

human SRCDp4Gal-Tl . Under native conditions a portion of the soluble SRCD 
sample representing misfolded proteins does not enter the gel (arrow 1 in lanes 2 
and 4). 

Fig. 7 illustrates binding and characterization of folded CDP4Gal-Tl and 
20 SRCDP4Gal-Tl on UDP-agarose columns. The purity of the proteins was 
judged using SDS-PAGE analysis under native conditions in the absence of 
pME and without boiling the samples. The protein sample before loading on the 
UDP-agarose column (U) and after eluting with 25 mM EDTA and 1M NaCl 
(B). Panel (A) shows the results with the human CDp4Gal-Tl before and after 
25 passing through the UDP-agarose column, (bovine CDp4Gal-Tl [not shown] 
behaves in the same way). Panel (B) shows the results with bovine 
SRCDp4Gal-Tl, and Panel (C) shows the results with human SRCDp4Gal-Tl. 
The arrows at the top of the lanes in panels B and C show the presence of soluble 
but misfolded proteins. 
30 Fig. 8 shows the enzymatic activity of the soluble human and bovine 

SRCDp4Gal-Tl proteins before and after UDP-agarose purification. The 
galactosyltransferase activities were measured as described herein. Specific 
activities of these proteins before binding ((-), black bars) and of the eluates after 
passing through the UDP-agarose column ((+), cross-hatched bars) are shown. 

S 
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(A) Buffer condition I (without PEG-4000 and L-arginine), showed more 
misfolded molecules in the human SRCD (34Gal-Tl compared to bovine 
SRCDp4Gal-Tl. (B) Condition VUI (with PEG-4000 and L-arginine) showed 
more properly folded proteins in the human SRCD(34Gal-Tl compared to bovine 

5 SRCDp4Gal-Tl (black bars). CD34Gal-Tl showed difference in the amounts of 
folded proteins before and after binding on the UDP-agarose columns either with 
folding condition I or condition VIH (data not shown). 

Fig. 9 is a schematic diagram of the in vitro folding of CD and SRCD 
domains of p4Gal-Tl . The unfolded CD and SRCD domains during the in vitro 

10 folding step are thought to generate folding intermediates (I) and (II), 

respectively, which produce misfolded and properly folded molecules. The 
misfolded molecules of CD, (III) are mostly insoluble and precipitate out, 
whereas properly folded molecules (IV) are soluble, remain in solution and bind 
to a UDP-agarose column. SRCD misfolded (HI') and properly folded 

15 molecules (IV') are mostly soluble. Due to the solubility effect of SR domain 
the proportion of the properly folded SRCD molecules increased during the 
folding process. The misfolded SRCD molecules, although soluble, did not 
significantly bind to UDP-agarose columns (IIT). Properly folded SRCD 
molecules bind (IV') and elute from the UDP-agarose column. The presence of 

20 PEG-4000 and L-arginine in the folding solution increased the proportion of the 
properly folded molecules. 

Detailed Description of the Invention 
j3(l,4)-galactosyltransferase I catalyzes the transfer of galactose from the 

25 donor, UDP-galactose, to an acceptor, N-acetylglucosamine, to form a galactose- 
j3(l,4)-N-acetylglucosamine bond. This reaction allows galactose to be linked to 
an N-acetylglucosamine that may itself be linked to a variety of other molecules. 
Examples of these molecules include other sugars and proteins. The reaction can 
be used to make many types of molecules having great biological significance. 

30 For example, galactose-j8(l,4)-N-acetylglucosamine linkages are important for 
many recognition events that control how cells interact with each other in the 
body, and how cells interact with pathogens. In addition, numerous other 
linkages of this type are also very important for cellular recognition and binding 
events as well as cellular interactions with pathogens, such as viruses. 
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Therefore, methods to synthesize these types of bonds have many applications in 
research and medicine to develop pharmaceutical agents and improved vaccines 
that can be used to treat disease. 

The present invention is based on the surprising discovery that the 

5 enzymatic activity of j8(l ,4)-galactosyltransferase can be altered such that the 
enzyme can make chemical bonds that are very difficult to make by other 
methods. These alterations involve mutating the enzyme such that the mutated 
enzyme can transfer many different types of donors, sugars for example, to many 
different types of acceptors. Therefore, the mutated |8(1,4)- 

10 galactosyltransferases of the invention can be used to synthesize a variety of 
products that, until now, have been very difficult and expensive to produce. 

The invention also provides amino acid segments that promote the proper 
folding of a galactosyltransferase catalytic domain. The amino acid segments 
maybe used to properly fold the galactosyltransferase catalytic domains of the 

15 invention and thereby increase their activity. The amino acid segments may also 
be used to increase the activity of galactosyltransferases that are produced 
recombinantly. Accordingly, use of the amino acid segments according to the 
invention allows for production of j8(l,4)-galactosyltransferases having increased 
enzymatic activity relative to j3(l,4)-galactosyltransferases produced in the 

20 absence of the amino acid segments. 

Definitions: 

Abbreviations: stem region/catalytic domain |3( l,4)-Galactosyltransferase 
I (SRCD|34Gal-Tl); catalytic domain of |S(l,4)-Galactosyltransferase I 
25 (CDp4Gal-Tl); j3(l,4)-Galactosyltransferase I ((34Gal-Tl); catalytic domain 
(CD); stem region (SR); wild-type (wt); galactosyltransferase activity (Gal-T); 
beta-mercaptoethanol ((3-ME); N-acetylgalactosamine transferase activity 
(GalNAc-T ); a-Lactalbumin (LA). 

The term "acceptor" refers to a molecule or structure onto which a donor 
30 is actively linked through action of a catalytic domain of a galactosyltransferase, 
or mutant thereof. Examples of acceptors include, but are not limited to, 
carbohydrates, glycoproteins, and glycolipids. 

The term "catalytic domain" refers to an amino acid segment which folds 
into a domain that is able to catalyze the linkage of a donor to an acceptor. For 

1 
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example, a catalytic domain may be from, but is not limited to, bovine j3(l,4)- 
Galactosyltransferase I (Seq ID NO: 5), the catalytic domain from human 0(1,4)- 
Galactosyltransferase I (Seq ID NO: 3), or the catalytic domain from mouse 
i8(l,4)-Galactosyltransferase I (Seq ID NO: 4) . A catalytic domain may have an 

5 amino acid sequence found in a wild-type enzyme, or may have an amino acid 
sequence that is different from a wild-type sequence. For example, a catalytic 
domain may have an amino acid sequence that corresponds to amino acid 
residues 130-402 of SEQ ID NO: 5, expect that the lysine is exchanged with 
arginine at amino acid position 228. 

10 The term "donor" refers to a molecule that is actively linked to an 

acceptor molecule through the action of a catalytic domain of a 
galactosyltransferase, or mutant thereof A donor molecule can include a sugar, 
or a sugar derivative. Examples of donors include, but are not limited to, UDP- 
galactose, UDP-mannose, UDP-N-acetylglucosamine, UDP-glucose, GDP- 

15 mannose, UDP-N-acetylgalactosamine, UDP-glucuronic acid, GDP-Fucose, and 
CMP-N-acetylneuraminic acid. Donors include sugar derivatives that include 
active groups, such as cross-linking agents or labeling agents. Accordingly, 
oligosaccharides may be prepared according to the methods of the invention that 
include a sugar derivative having a desired characteristic. 

20 "Expression cassette" as used herein means a DNA sequence capable of 

directing expression of a particular nucleotide sequence in an appropriate host 
cell, comprising a promoter operably linked to the nucleotide sequence of 
interest that is operably linked to termination signals. It also typically comprises 
sequences required for proper translation of the nucleotide sequence. The 

25 expression cassette may be one that is naturally occurring but has been obtained 
in a recombinant form useful for heterologous expression. The expression of the 
nucleotide sequence in the expression cassette may be under the control of a 
constitutive promoter or of an inducible promoter that initiates transcription only 
when the host cell is exposed to some particular external stimulus. In the case of 

30 a multicellular organism, the promoter can also be specific to a particular tissue 
or organ or stage of development. 

The terms oligosaccharide and polysaccharide are used interchangeably 
herein. These terms refer to saccharide chains having two or more linked sugars. 
Oligosaccharides and polysaccharides may be homopolymers and 
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heteropolymers having a random sugar sequence or a preselected sugar 
sequence. Additionally, oligosaccharides and polysaccharides may contain 
sugars that are normally found in nature, derivatives of sugars, and mixed 
polymers thereof. 

5 "Polypeptides" and "Proteins" are used interchangeably herein. 

Polypeptides and proteins can be expressed in vivo through use of prokaryotic or 
eukaryotic expression systems. Many such expressions systems are known in 
the art and are commercially available. (Clontech, Palo Alto, CA; Stratagene, La 
Jolla, CA). Examples of such systems include, but are not limited to, the T7- 
10 expression system in prokaryotes and the bacculovirus expression system in 
eukaryotes. Polypeptides can also be synthesized in vitro, e.g., by the solid 
phase peptide synthetic method or by in vitro transcription/translation systems. 
Such methods are described, for example, in U.S. Patent Nos. 5,595,887; 
5,116,750; 5,168,049 and 5,053,133; Olson et al., Peptides, 9, 301, 307 (1988). 
1 5 The solid phase peptide synthetic method is an established and widely used 
method, which is described in the following references: Stewart et al., Solid 
Phase Peptide Synthesis, W. H. Freeman Co., San Francisco (1969); Merrifield, 
J. Am. Chem. Soc, 85 2149 (1963); Meienhofer in "Hormonal Proteins and 
Peptides," ed.; C.H. Li, Vol. 2 (Academic Press, 1973), pp. 48-267; Bavaay and 
20 Merrifield, "The Peptides," eds. E. Gross and F. Meienhofer, Vol. 2 (Academic 
Press, 1980) pp. 3-285; and Clark-Lewis et al., Meth. EnzvmoL 287, 233 
(1 997). These polypeptides can be further purified by fractionation on 
immunoaffinity or ion-exchange columns; ethanol precipitation; reverse phase 
HPLC; chromatography on silica or on an anion-exchange resin such as DEAE; 
25 chromatofocusing; SDS-PAGE; ammonium sulfate precipitation; gel filtration 
using, for example, Sephadex G-75; or ligand affinity chromatography. 

The polypeptides of the invention include polypeptides having amino 
acid exchanges, i.e., variant polypeptides, so long as the polypeptide variant is 
biologically active. The variant polypeptides include the exchange of at least 
30 one amino acid residue in the polypeptide for another amino acid residue, 

including exchanges that utilize the D rather than L form, as well as other well 
known amino acid analogs, e.g., N-alkyl amino acids, lactic acid, and the like. 
These analogs include phosphoserine, phosphothreonine, phosphotyrosine, 
hydroxyproline, gamma-carboxyglutamate; hippuric acid, octahydroindole-2- 
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carboxylic acid, statine, l,2,3,4,-tetrahydroisoquinoline-3-carboxylic acid, 
penicillamine, ornithine, citruline, N-methyl-alanine, para-benzoyl- 
phenylalanine, phenylglycine, propargylglycine, sarcosine, N-acetylserine, N- 
formylmethionine, 3-methylhistidine, 5-hydroxylysine, and other similar amino 
5 acids and imino acids and tert-butylglycine. 

Conservative amino acid exchanges are preferred and include, for 
example; aspartic-glutamic as acidic amino acids; lysine/arginine/histidine as 
basic amino acids; leucine/isoleucine, methionine/valine, alanine/valine as 
hydrophobic amino acids; serine/glycine/alanine/threonine as hydrophilic amino 

10 acids. Conservative amino acid exchange also includes groupings based on side 
chains. Members in each group can be exchanged with another. For example, a 
group of amino acids having aliphatic side chains is glycine, alanine, valine, 
leucine, and isoleucine. These may be exchanged with one another. A group of 
amino acids having aliphatic-hydroxyl side chains is serine and threonine. A 

1 5 group of amino acids having amide-containing side chains is asparagine and 
glutamine. A group of amino acids having aromatic side chains is 
phenylalanine, tyrosine, and tryptophan. A group of amino acids having basic 
side chains is lysine, arginine, and histidine. A group of amino acids having 
sulfur-containing side chains is cysteine and methionine. For example, 

20 replacement of a leucine with an isoleucine or valine, an aspartate with a 

glutamate, a threonine with a serine, or a similar replacement of an amino acid 
with a structurally related amino acid may be accomplished to produce a variant 
polypeptide of the invention. 

25 I- ff(L4VGala ctosvltransferase I catalytic domains of the invention. 

A. Catalytic domains that catalyze the formation of a bond between a 
donor and an acc eptor to form glucose-j8(L4)-N-acetvlp lucosamiTift htmA& 

It has been discovered that mutation of the donor binding site of 0(1,4)- 
galactosyltransferase I can broaden the donor specificity of the enzyme. More 

30 specifically, it has been determined that substitution of amino acid residues 
located in the donor binding site of j3(l,4)-galactosyltransferase I to provide 
greater flexibility and decreased steric hindrance allow glucose to be bound and 
chemically bonded to N-acetylglucosamine. Such mutations provide for 
broadened donor binding, such as binding of glucose, while still preserving 
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interaction with amino acid residues active during catalytic bond formation 
between the donor and the acceptor. Without being bound by any theory, an 
example of a catalytic residue thought to be important for catalysis is a glutamic 
acid positioned at amino acid position 317 (E3 17) in the bovine 0(1,4)- 
5 galactosyltransferase I. This glutamic acid in bovine 0(1,4)- 

galactosyltransferase I corresponds to a glutamic acid residue at amino acid 
position 313 and at amino acid position 314 in the human and mouse 0(1,4)- 
galactosyltransferase I respectively. Accordingly, the invention provides 0(1,4)- 
galactosyltransferase I mutants having amino acid substitutions, insertions, and 
1 0 deletions that provide greater flexibility and decreased steric hindrance in the 
donor binding site to allow the mutated 0(l,4)-galactosyltransferase I to catalyze 
chemical bonding of the donor to an acceptor, such as N-acetylglucosamine or 
glucose. 

In some embodiments, the catalytic domains of the invention have an 

15 arginine exchanged with another amino acid at an amino acid position 

corresponding to 228 in the bovine 0(l,4)-galactosyltransferase I (SEQ ID NO: 
5). An example of a specific exchange is R228K. The corresponding arginine in 
the human and mouse 0(l,4)-galactosyltransferase I is located at amino acid 
position 224 and 225 (SEQ ID Nos: 3 and 4 respectively). In mouse, human, 

20 and bovine 0(1 ,4)-galactosyltransferase I, the arginine is located within the 

amino acid sequence FNRAKLL (SEQ ID NO: 1), Accordingly, those of skill in 
the art can readily determine an equivalent amino acid in other 0(1,4)- 
galactosyltransferase I catalytic domains. 

In other embodiments, the catalytic domains of the invention have an 

25 arginine exchanged with another amino acid at an amino acid position 

corresponding to 228, and an alanine exchanged with another amino acid at an 
amino acid position corresponding to 229 in the bovine 0(1, 4)- 
galactosyltransferase I. 

Such catalytic domains are exemplified by a catalytic domain of bovine 

30 0(l,4)-galactosyltransferase I having the arginine at amino acid position 228 
exchanged with lysine (R228K), and the alanine at amino acid position 229 
exchanged with glycine (A229G). The corresponding alanine in the human and 
mouse 0(l,4)-galactosyltransferase I is located at amino acid position 225 and 
226 (SEQ ID Nos: 3 and 4 respectively). In mouse, human, and bovine 0(1,4)- 
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galactosyltransferase I, the arginine is located within the amino acid sequence 
FNRAKLL (SEQ ID NO: 1). Accordingly, those of skill in the art can readily 
determine an equivalent amino acid in other j3(l,4)-galactosyltransferase I 
catalytic domains. 

5 

B. Catalytic domains that catalyze the formation of a bond b etween a 
donor and an acceptor to form N-acetvlgalac tosamine-flQ.4VN- 
acetvlglucosamine bonds. 

It was postulated that formation of a hydrogen bond between N- 

1 0 acetylgalactosamine and an amino acid residue adjoining the donor binding site 
in j3(l,4)-galactosyltransferase I is responsible for poor transfer of N- 
acetylgalactosamine to an acceptor. It was also postulated that mutation of one 
or more amino acid residues in the donor binding site in 0(1,4)- 
galactosyltransferase I to eliminate hydrogen bond formation with N- 

1 5 acetylgalactosamine allows the mutated /3(1 ,4)-galactosyltransferase I to transfer 
N-acetylgalactosamine from a donor to an acceptor more efficiently. Therefore, 
the invention includes mutants of j3(l,4)-galactosyltransferase I in which 
hydrogen bonds that reduce transfer of N-acetylgalactosamine to an acceptor, 
such as N-acetylglucosamine or glucose, are reduced or absent. 

20 In some embodiments, the catalytic domains of the invention have a 

tyrosine exchanged with another amino acid at an amino acid position 
corresponding to 289 in the bovine j3(l,4)-galactosyltransferase I (SEQ ID NO: 
5). Examples of specific exchanges are Y289L,Y289I, and Y289N. The 
corresponding tyrosine in the human and mouse ]8(l,4)-galactosyltransferase I is 

25 located at amino acid position 285 and 286 (SEQ ID Nos: 3 and 4 respectively). 
In mouse, human, and bovine /3(l,4)-galactosyltransferase I, the tyrosine is 
located within the amino acid sequence YVQYFGG (SEQ ID NO: 2). 
Accordingly, those of skill in the art can readily determine equivalent amino 
acids in other /3(l,4)-galactosyltransferase I catalytic domains, 

30 Mutants in which the tyrosine corresponding to that located at amino acid 

position 289 in the bovine j3(l,4)-galactosyltransferase I has been exchanged by 
another amino acid may optionally include a second mutation corresponding to 
amino acid position 342. Such a mutation may include exchange of cysteine at 
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amino acid position 342 with threonine (C342T). However, other amino acids 
may be exchanged for cysteine that provide and active catalytic domain. 

C. Catalytic domains that catalyze the formation of a bond between a 
5 donor and an accentor to form N-acetvlealactosamine-flf 1.4Vglucose bonds. 

0( 1 ,4)-galactosyltransferase I mutants, as described herein, that are able 
to catalyze chemical bond formation of N-acetylgalactosamine to an acceptor 
may be used in conjunction with o-lactalbumin to catalyze the formation of N- 
acetylgalactosamine-/3(l,4)-glucose bonds. 
10 a-Lactalbumin is a mammary gland-specific calcium-binding protein that 

alters the sugar acceptor specificity of j8(l,4)-galactosyltransferase I toward 
glucose. Consequently, a-lactalbumin may be used to alter the acceptor 
specificity /3(l,4)-gdactosyltransferase I, and mutants thereof that are described 
herein, to efficiently catalyze N-acetylgalactosamine-j8(l,4)-glucose bond 
15 formation. Conditions for use of a-lactalbumin in conjunction with a 
galactosyltransferase, or active domain thereof, have been described 
(Ramakrishnan et al., J. Biol. Chem.. 276:37665 (2001)). 

D - Catalytic do mains that catalyze the formation of a bond between a 
20 donor and a n acceptor to form N-acetvlg 1ucosamin eH8n.4VN- 
acetvlglucosamin e bonds, oligo N-acetvlga lactosamin e-iSf 1.41N- 
acetvlglucosamin e. and mannose-/3fl.4VN-acetvlglucosaminft 

The mutations described herein may be combined to create catalytic 
domains having selectively altered donor specificities. For example, a catalytic 
25 domain obtained from bovine jS(l ,4)-galactosyltransferase I having exchanges at 
amino acid positions 228 (R228K) and 289 (Y289L) was able to catalyze the 
linkage of N-acetylglucosamine to N-acetylglucosamine to form aN- 
acetylglucosamine-^(l,4)-N-acetylglucosamine bond. The same mutant 
catalytic domain was able to catalyze the linkage of N-acetylgalactosamine to N- 
30 acetylglucosamine to form oligo N-acetylgalactosamine-j8(l,4)-N- 

acetylglucosamine. The broadened donor specificity was further demonstrated 
by the ability of the mutant catalytic domain to catalyze the linkage of mannose 
to N-acetylglucosamine to form mannose-/3(l,4)-N-acetylglucosamine. 
Accordingly, numerous mutant catalytic domains having altered donor 

\3 
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specificity may be created by mutating amino acids corresponding to those at 
positions corresponding to 228 and 289 of the bovine 0(1,4)- 
galactosyltransferase L As described above, these amino acid positions may be 
readily determined in galactosyltransferase enzymes obtained from other 
5 organisms, such as humans, and mutated in produce additional catalytic domains 
having altered donor specificity. 

E. Catalytic domains that catalyze the formation of a bond b etween a 
donor and an acceptor having a bulky side-group to form for example, 

10 galactose-flf 1 »4VN-acetvlglucosamine-6-SCh bonds. 

The acceptor specificity of a catalytic domain obtained from a 
galactosyltransferase may be altered to create catalytic domains capable of 
transferring a donor onto an acceptor having a bulky and/or charged side-group. 
An example of such an altered catalytic domain obtained from bovine 

15 j8(l,4)-galactosyltransferase I has substitutions at amino acid positions 279 

(K279S) and 280 (F280T). This altered catalytic domain is able to catalyze the 
transfer of galactose to N-acetylglucosamine-6-S0 3 to form galactose-j3(l,4)-N- 
acetylglucosamine-6-S03. Additional catalytic domains may be created by 
altering one or more amino acid residues at positions corresponding to 279 and 

20 280 of the bovine /3(l,4)-galactosyltransferase L As described above, these 
amino acid positions may be readily determined in galactosyltransferase 
enzymes obtained from other organisms, such as humans, and mutated in 
produce additional catalytic domains having altered acceptor specificity. 

The amino acids at positions corresponding to 279 and 280 in the bovine 

25 0(l,4)-galactosyltransferase I may be exchanged individually or together to 
create many different catalytic domains having altered acceptor sites able to 
accept numerous acceptors having bulky (sterically large) or charged side- 
groups. Such altered catalytic domains may be used to catalyze linkage of 
sugars from a donor to an acceptor having a desired side-chain. 

30 

II. Catalytic domains of the invention may be included within full-length 
/3(l,4>galactosvltransferase I enzymes, or in recombinant molecules containing 
the catalytic domains. 
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Peptides of the invention include isolated catalytic domains, full-length 
/3(l,4)-galactosyltransferase I enzymes containing a catalytic domain of the 
invention, as well as recombinant polypeptides comprising a catalytic domain of 
the invention that are linked to additional amino acids. Such polypeptides may 
5 be expressed from DNA constructs and expression cassettes that are produced 
through use of recombinant methods. Such methods have been described. 
Sambrook et aL, Molecular Cloning: A Laboratory Manual, 3rd edition, Cold 
Spring Harbor Press, Cold Spring Harbor, N.Y. (2001). 

Galactosyltransferase enzymes containing a catalytic domain of the 
1 0 invention may be produced in soluble form. Methods that may be used to 

produce such soluble enzymes have been described (U.S. Patent No. 5,032,519). 
Briefly, a hydrophobic transmembrane anchor region of a galactosyltransferase 
is removed to produce an enzyme that is in soluble form. 

Alternatively, /3(l,4)-galactosyltransferase enzymes containing a catalytic 
1 5 domain of the invention may be produced such that they are anchored in the 

membrane of a cell that expresses the galactosyltransferase. Such enzymes may 
be produced that are anchored in the membranes of prokaryotic and eukaryotic 
cells. Methods to produce such enzymes have been described (U.S. Patent No: 
6,284,493). 

20 Briefly, in the case of procaryotes, the signal and transmembrane 

sequences of the galactosyltransferase are replaced by a bacterial signal 
sequence, capable of effecting localization of the fusion protein to the outer 
membrane. Suitable signal sequences include, but are not limited to those from 
the major E.coli lipoprotein Lpp and lam B. In addition, membrane spanning 

25 regions from Omp A, Omp C, Omp F or Pho E can be used in a tripartite fusion 
protein to direct proper insertion of the fusion protein into the outer membrane. 
Any procaryotic cells can be used in accordance with the present invention 
including but not limited to E. coli, Bacillus sp., and Pseudomonas sp. as 
representative examples. 

30 In another embodiment, the native transmembrane domain of the 

galactosyltransferase is replaced by the transmembrane domain of a bacterial 
outer membrane protein. In this embodiment, the galactosyltransferase signal 
sequence and the bacterial transmembrane region act in concert to anchor the 
galactosyltransferase to the bacterial outer cell membrane. Nearly any outer 
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membrane bound protein is suitable for this use including but not limited to Omp 
A, Omp C, and Omp F, Lpp, and Lam B. The catalytic portion of the 
galactosyltransferase should be fused to an extracellular loop in the bacterial 
transmembrane region in order to insure proper orientation of the fusion protein 
5 on the outer membrane surface and not in the cytoplasm or periplasm of the cell. 
Insertion of a protein into such a loop region has been previously reported 
(Charbit et al., J. Bacteriology, 173:262 (1991); Francisco et al., Proc. Natl 
Acad. ScL 89:2713(1992)). 

The present invention is also applicable for use with eukaryotic cells 

10 resulting in cell surface expression of galactosyltransferases in known culturable 
eucaryotic cells including but not limited to yeast cells, insect cells, Chinese 
hamster ovary cells (CHO cells), mouse L cells, mouse A9 cells, baby hamster 
kidney cells, C127 cells, COS cells, Sf9 cells, and PC8 cells. 

In another embodiment of the present invention, the transmembrane 

15 domain of the galactosyltransferase is replaced by the transmembrane domain of 
a plasma membrane protein. The transmembrane domain of any resident plasma 
membrane protein will be appropriate for this purpose. The transmembrane 
portions of the M6 P/IGF-II receptor, LDL receptor or the transferrin receptor 
are representative examples. 

20 In another embodiment the Golgi retention signal of the 

galactosyltransferase is disrupted by site-directed mutagenesis. This approach 
mutates the amino acids responsible for localizing the galactosyltransferase to 
the Golgi compartment. The resultant galactosyltransferase is transported to the 
plasma membrane where it becomes anchored via its modified transmembrane 

25 sequences. Substitution of isoleucine residues for the native amino acids in the 
transmembrane region of the /?(l,4)galactosyltransferase has been shown to 
preferentially localize the enzyme to the plasma membrane instead of the Golgi 
apparatus (Masibay et al., J. Biol. Chenu 268:9908 (1993)). 

30 III. A stem region that promotes the in vitro folding of a catalytic domain of 
a galactosyltransferase. 

/?(l,4)-galactosyltransferase I is a type II Golgi resident protein with a 
short cytoplasmic tail, a transmembrane domain followed by a stem region and 
has a globular catalytic domain that faces the Golgi lumen. When the catalytic 
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domain of j3(l,4)-galactosyltransferase I is expressed in E. coli, it forms 
insoluble inclusion bodies. These inclusion bodies can be collected and then 
solubilized and folded in vitro to produce catalytically active domains. Thus, the 
in vitro folding efficiency is directly related to the quantity of active enzyme that 
5 is produced from the isolated inclusion bodies. Accordingly, methods to 
increase the in vitro folding efficiency would provide increased production of 
catalytic domains that can be used to create useful products. 

The invention provides materials and methods that improve in vitro 
folding of catalytic domains from galactosyltransferases that are related to the 

10 use of a stem region (for example, SEQ ID NOs: 6 and 7) of 0(1,4)- 

galactosyltransferase I. It has been determined that fusion of a stem region from 
a 0(l,4)-galactosyltransferase I to the amino-terminus of the catalytic domain of 
a /?(l,4)-galactosyltransferase I produces increased in vitro folding efficiency of 
the catalytic domain. This increase in folding is thought to be universal among 

15 0(1 ,4)-galactosyltransferase I enzymes and was demonstrated with both the 
bovine and human enzymes. 

It has been further discovered that inclusion of PEG-4000 and L-Arg in 
the folding reaction results in a four-fold to seven- fold increase in catalytic 
domains that are natively folded when compared to refolding of the catalytic 

20 domain alone in the absence of PEG-4000 and L-Arg. PEG-4000 and L-arginine 
are thought to beneficially affect the solubility of folding intermediates of both 
catalytic domain-proteins (CD-proteins) and stem region/catalytic domain 
proteins (SRCD-proteins) during in vitro folding or protein obtained from 
inclusion bodies. The presence of PEG-4000 and L-arginine during in vitro 

25 folding enhanced the formation of both native and misfolded molecules. The 
processes involved are schematically shown in Figure 9. In the case of CD- 
proteins, the majority of misfolded proteins are insoluble in the absence of PEG- 
4000 and L-arginine and thus, they precipitate out during dialysis. This process 
left behind the properly folded molecules in solution that bound to UDP-agarose 

30 and were enzymatically active. (Figure 8) (Table IX). It is thought that the SR- 
domain, like PEG-4000 and L-arginine, helped to solubilize the folding 
intermediates, and hence enhanced the formation of both native and misfolded- 
SRCD molecules. The presence of PEG-4000 and L-arginine enhanced the 
solubilization of the folding intermediates of SRCD-molecules even further. The 



WO 2004/063344 



PCTYUS2004/000470 



misfolded SRCD proteins, in contrast to the majority of CD-proteins, remained 
soluble even in the absence of PEG-4000 and L-arginine. Therefore, the 
misfolded SRCD-proteins were not removed as precipitates during dialysis. 
Misfolded SRCD-proteins can be separated from properly folded proteins 
5 through binding on UDP-agarose columns (Table DC). Thus the SR-domain is 
thought to act as a solubilizing agent both for the misfolded and folded catalytic 
domain. It is thought that the increased solubility of SRCD-proteins is produced 
by preventing aggregation of misfolded proteins. In this respect its mode of 
action is thought to resemble the action of chaperone proteins. The positive 

10 effect of the iV-terminal stem region in the folding and stability of the native 
protein is very usefid for producing large quantities of other 
galactosyltransferase family members. 

The in vitro folding efficiency of bovine ]3(l,4)-galactosyltransferase I 
was further increased by substituting the cysteine at amino acid position 342 

1 5 with a threonine (C342T). Analogous mutations can be made in /3(1 ,4)- 
galactosyltransferase I enzymes from other organisms. 

It was determined that the wild-type bovine SRCDp4Gal-Tl, folded and 
purified from inclusion bodies, was cleaved at Ser96 within the stem region over 
a short period of time. Therefore, to decrease degradation of bovine 

20 SRCDp4Gal-Tl, the serine at amino acid position 96 was exchanged with an Ala 
to produce S96A-SRCDp4Gal-Tl . After folding and purification from bacterial 
inclusion bodies, S96A-SRCDp4Gal-Tl was found to be more stable over a long 
period of time when compared to SRCDp4Gal-Tl, which did not include the 
S96A mutation. S96A-SRCDp4Gal-Tl was used within some of the in vitro 

25 folding studies disclosed herein. 

The in vitro folding efficiency of catalytic domains that include a stem 
region in the presence of PEG-4000 and L-Arg was about 50 percent and the 
solubility of the refolded product was about 90 percent. 

Accordingly, the invention includes stem regions from members of the 

30 galactosyltransferase family that can be fused to a catalytic domain of a 
galactosyltransferase to provide increased in vitro folding of the catalytic 
domain. Such stem regions can be readily determined based on amino acid 
sequence homology to the bovine stem region and tested for the ability to 
promote folding of a galactosyltransferase catalytic domain. The invention also 
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includes the mutants disclosed herein and their corresponding analogs in other 
species. 

General methods for isolating and folding inclusion bodies containing 
galactosyltransferase catalytic domains have been previously described 
5 (Ramakrishnan et al., J. Biol. Chem.. 276:37665 (2001)). These methods may 
be used in conjunction with the stem region of the invention, PEG-4000, and L- 
Arg to increase the folding efficiency of a galactosyltransferase catalytic domain. 
These methods are described in the examples section herein. 

10 IV. Nucleic acid segments encoding catalytic domains of 6(1 AY 

galactosyltransferase I expression cassettes that include the nucleic acid 
segments, and cells that include the nucleic acid segments and expression 
cassettes. 

The present invention provides isolated nucleic acid segments that 

1 5 encode catalytic domains of |3(l,4)-galactosyltransferase I having altered donor 
or acceptor specificity. The present invention also provides nucleic acid 
segments that encode amino acid segments that promote proper folding of 
catalytic domains from galactosyltransferases, such as ]3(1,4)- 
galactosyltransferase I. 

20 Nucleic acid sequences encoding human /3(l,4)-galactosyltransferase I 

(SEQ ID NO: 8), as well as other /?(l,4)-galactosyltransferases I from other 
organisms are available. These nucleic acid sequences can be modified to 
encode the catalytic domains and amino acid segments of the invention through 
use of well-known techniques (Sambrook et al, Molecular Cloning: A 

25 Laboratory Manual, 3rd edition, Cold Spring Harbor Press, Cold Spring Harbor, 
N.Y. (2001)). For example, a portion of the nucleic acid sequence encoding 
human /3(l,4)-galactosyltransferase I (SEQ ID NO: 8) can be inserted into an 
expression vector such that an amino acid segment corresponding to the catalytic 
domain of human /3(l,4)-galactosyltransferase I (SEQ ID NO: 6) is expressed 

30 upon transformation of a cell with the expression vector. In another example, 
bovine /?(l,4)-galactosyltransferase I can be altered to replace the tyrosine at 
amino acid position 289 with leucine, isoleucine, or asparagine through use of 
site-directed mutagenesis (Ramakrishnan et al., J. Biol. Chem.. 277:20833 
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(2002)). Similar methods may be used to produce nucleic acid segments 
encoding additional mutants and catalytic domains described herein. 

The nucleic acid segments of the invention may be optimized for 
expression in select cells. Codon optimization tables are available. Harlow and 
5 Lane, Antibodies: A Laboratory Manual. Cold Spring Harbor Laboratory Press, 
1988. 

The nucleic acid segments can be inserted into numerous types of 
vectors. A vector may include, but is not limited to, any plasmid, phagemid, F- 
factor, virus, cosmid, or phage in double or single stranded linear or circular 

10 form which may or may not be self transmissible or mobilizable. The vector can 
also transform a prokaryotic or eukaryotic host either by integration into the 
cellular genome or exist extrachromosomally (e.g. autonomous replicating 
plasmid with an origin of replication). 

Preferably the nucleic acid segment in the vector is under the control of, 

15 and operably linked to, an appropriate promoter or other regulatory elements for 
transcription in vitro or in a host cell such as a eukaryotic cell or microbe, e.g. 
bacteria. The vector may be a bi-functional expression vector which functions in 
multiple hosts. In the case of genomic DNA, this may contain its own promoter 
or other regulatory elements and in the case of cDNA this may be under the 

20 control of a promoter or other regulatory sequences for expression in a host cell. 
Specifically included are shuttle vectors by which is meant a DNA 
vehicle capable, naturally or by design, of replication in two different host 
organisms, which may be selected from bacteria and eukaryotic cells (e.g. 
mammalian, yeast or fungal). 

25 The vector may also be a cloning vector which typically contains one or a 

small number of restriction endonuclease recognition sites at which nucleic acid 
segments can be inserted in a determinable fashion. Such insertion can occur 
without loss of essential biological function of the cloning vector. A cloning 
vector may also contain a marker gene that is suitable for use in the 

30 identification and selection of cells transformed with the cloning vector. 

Examples of marker genes are tetracycline resistance, hygromycin resistance or 
ampicillin resistance. Many cloning vectors are commercially available 
(Stratagene, New England Biolabs, Clonetech). 

^6 
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The nucleic acid segments of the invention may also be inserted into an 
expression vector. Typically an expression vector contains (1) prokaryotic DNA 
elements coding for abacterial replication origin and an antibiotic resistance 
gene to provide for the amplification and selection of the expression vector in a 

5 bacterial host; (2) regulatory elements that control initiation of transcription such 
as a promoter; and (3) DNA elements that control the processing of transcripts 
such as introns, transcription termination/polyadenylation sequence. 

Methods to introduce a nucleic acid segment into a vector are well known 
in the art (Sambrook et al., 1989). Briefly, a vector into which the nucleic acid 

1 0 segment is to be inserted is treated with one or more restriction enzymes 

(restriction endonuclease) to produce a linearized vector having a blunt end, a 
"sticky" end with a 5' or a 3 f overhang, or any combination of the above. The 
vector may also be treated with a restriction enzyme and subsequently treated with 
another modifying enzyme, such as a polymerase, an exonuclease, a phosphatase 

15 or a kinase, to create a linearized vector that has characteristics useful for ligation 
of a nucleic acid segment into the vector. The nucleic acid segment that is to be 
inserted into the vector is treated with one or more restriction enzymes to create a 
linearized segment having a blunt end, a "sticky" end with a 5 f or a 3 f overhang, or 
any combination of the above. The nucleic acid segment may also be treated with 

20 a restriction enzyme and subsequently treated with another DNA modifying 
enzyme. Such DNA modifying enzymes include, but are not limited to, 
polymerase, exonuclease, phosphatase or a kinase, to create a polynucleic acid 
segment that has characteristics useful for ligation of a nucleic acid segment into 
the vector. 

25 The treated vector and nucleic acid segment are then ligated together to 

form a construct containing a nucleic acid segment according to methods known 
in the art (Sambrook, 2002). Briefly, the treated nucleic acid fragment and the 
treated vector are combined in the presence of a suitable buffer and ligase. The 
mixture is then incubated under appropriate conditions to allow the ligase to ligate 

30 the nucleic acid fragment into the vector. It is preferred that the nucleic acid 
fragment and the vector each have complimentary "sticky" ends to increase 
ligation efficiency, as opposed to blunt-end ligation. It is more preferred that the 
vector and nucleic acid fragment are each treated with two different restriction 
enzymes to produce two different complimentary "sticky" ends. This allows for 

£\ 
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directional ligation of the nucleic acid fragment into the vector, increases ligation 
efficiency and avoids ligation of the ends of the vector to reform the vector 
without the inserted nucleic acid fragment. 

Suitable procaryotic vectors include but are not limited to pBR322, 
5 pMB9, pUC, lambda bacteriophage, ml3 bacteriophage, and Bluescript®. 
Suitable eukaryotic vectors include but are not limited to PMSG, pAV009/A+, 
PMTO10/A+, pMAM neo-5, bacculovirus, pDSVE, YIP5, YRP17, YEP. It will 
be clear to one of ordinary skill in the art which vector or promoter system 
should be used depending on which cell type is used for a host cell. 
10 The invention also provides expression cassettes which contain a control 

sequence capable of directing expression of a particular nucleic acid segment of 
the invention either in vitro or in a host cell. The expression cassette is an 
isolatable unit such that the expression cassette may be in linear form and 
functional in in vitro transcription and translation assays. The materials and 

15 procedures to conduct these assays are commercially available from Promega 
Corp. (Madison, Wisconsin). For example, an in vitro transcript maybe 
produced by placing a nucleic acid segment under the control of a T7 promoter 
and then using T7 RNA polymerase to produce an in vitro transcript. This 
transcript may then be translated in vitro through use of a rabbit reticulocyte 

20 lysate. Alternatively, the expression cassette can be incorporated into a vector 
allowing for replication and amplification of the expression cassette within a 
host cell or also in vitro transcription and translation of a nucleic acid segment. 

Such an expression cassette may contain one or a plurality of restriction 
sites allowing for placement of the nucleic acid segment under the regulation of 

25 a regulatory sequence. The expression cassette can also contain a termination 
signal operably linked to the nucleic acid segment as well as regulatory 
sequences required for proper translation of the nucleic acid segment. 
Expression of the nucleic acid segment in the expression cassette maybe under 
the control of a constitutive promoter or an inducible promoter which initiates 

30 transcription only when the host cell is exposed to some particular external 
stimulus. i 

The expression cassette may include in the 5'-3' direction of transcription, 
a transcriptional and translational initiation region, a nucleic acid segment and a 
transcriptional and translational termination region functional in vivo and /or in 
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vitro. The termination region may be native with the transcriptional initiation 
region, may be native with the nucleic acid segment, or may be derived from 
another source. Numerous termination regions are known in the art. Guerineau 
et al, Mol. Gen. Genet.. 262:141 (1991); Proudfoot, Cell, 64:671 (1991); 
5 Sanfacon et al., Genes Dev., 5:141 (1991); Munroe et al., Gene. 91:151 (1990); 
Ballas et al., Nucleic Acids Res.. 17:7891 (1989); Joshi et al., Nucleic Acid Res.. 
15:9627 (1987). 

The regulatory sequence can be a nucleic acid sequence located upstream 
(5' non-coding sequences), within, or downstream (3' non-coding sequences) of 

10 a coding sequence, and which influences the transcription, RNA processing or 
stability, or translation of the associated coding sequence. Regulatory sequences 
can include, but are not limited to, enhancers, promoters, repressor binding sites, 
translation leader sequences, introns, and polyadenylation signal sequences. 
They may include natural and synthetic sequences as well as sequences which 

15 may be a combination of synthetic and natural sequences. While regulatory 
sequences are not limited to promoters, some useful regulatory sequences 
include constitutive promoters, inducible promoters, regulated promoters, tissue- 
specific promoters, viral promoters and synthetic promoters. 

A promoter is a nucleotide sequence that controls expression of the 

20 coding sequence by providing the recognition for RNA polymerase and other 
factors required for proper transcription. A promoter includes a minimal 
promoter, consisting only of all basal elements needed for transcription 
initiation, such as a TATA-box and/or initiator that is a short DNA sequence 
comprised of a TATA- box and other sequences that serve to specify the site of 

25 transcription initiation, to which regulatory elements are added for control of 
expression. A promoter may be inducible. Several inducible promoters have 
been reported (Current Opinion in Biotechnology . 7:168 (1996)). Examples 
include the tetracycline repressor system, Lac repressor system, copper-inducible 
systems, salicylate-inducible systems (such as the PRla system). Also included 

30 are the benzene sulphonamide- (U.S. 5364,780) and alcohol-(WO 97/06269 and 
WO 97/06268) inducible systems and glutathione S-transferase promoters. In 
the case of a multicellular organism, the promoter can also be specific to a 
particular tissue or organ or stage of development. 
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An enhancer is a DNA sequence which can stimulate promoter activity 
and may be ah innate element of the promoter or a heterologous element inserted 
to enhance the level or tissue specificity of a promoter. It is capable of operating 
in both orientations (normal or flipped), and is capable of functioning even when 
5 moved either upstream or downstream from the promoter. Both enhancers and 
other upstream promoter elements bind sequence-specific DNA-binding proteins 
that mediate their effects. 

The expression cassette can contain a 5' non-coding sequence which is a 
nucleotide sequence located 5' (upstream) to the coding sequence. It is present in 
1 0 the fully processed mRNA upstream of the initiation codon and may affect 
processing of the primary transcript to mRNA, stability of the mRNA, or 
translation efficiency (Turner et al., Molecular Biotechnology. 3:225 (1995)). 

The expression cassette may also contain a 3' non-coding sequence 
which is a nucleotide sequence located 3' (downstream) to a coding sequence 
1 5 and includes polyadenylation signal sequences and other sequences encoding 
regulatory signals capable of affecting mRNA processing or gene expression. 
The polyadenylation signal is usually characterized by affecting the addition of 
polyadenylic acid tracts to the 3' end of the mRNA precursor. 

The invention also provides a construct containing a vector and an 
20 expression cassette. The vector may be selected from, but not limited to, any 
vector previously described. Into this vector may be inserted an expression 
cassette through methods known in the art and previously described (Sambrook et 
al., 1989). In one embodiment, the regulatory sequences of the expression cassette 
may be derived from a source other than the vector into which the expression 
25 cassette is inserted. In another embodiment, a construct containing a vector and an 
expression cassette is formed upon insertion of a nucleic acid segment of the 
invention into a vector that itself contains regulatory sequences. Thus, an 
expression cassette is formed upon insertion of the nucleic acid segment into the 
vector. Vectors containing regulatory sequences are available commercially and 
30 methods for their use are known in the art (Clonetech, Promega, Stratagene). 

The expression cassette, or a vector construct containing the expression 
cassette may be inserted into a cell. The expression cassette or vector construct 
maybe carried eposomally or integrated into the genome of the cell. 
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A variety of techniques are available and known to those skilled in the art 
for introduction of constructs into a cellular host. Transformation of bacteria and 
many eukaryotic cells may be accomplished through use of polyethylene glycol, 
calcium chloride, viral infection, phage infection, electroporation and other 
5 methods known in the art. Other transformation methods are available to those 
skilled in the art, such as direct uptake of foreign DNA constructs (see EP 
295959), techniques of electroporation (Fromm et al. Nature (London), 319:791 
(1986) or high velocity ballistic bombardment with metal particles coated with 
the nucleic acid constructs (Kline et al. Nature (London) 327:70 (1987), and U.S. 

10 Patent No. 4,945,050). 

The selection of an appropriate expression vector will depend upon the 
method of introducing the expression vector into host cells. Typically an 
expression vector contains (1) prokaryotic DNA elements coding for a bacterial 
origin of replication and an antibiotic resistance gene to provide for the 

15 amplification and selection of the expression vector in a bacterial host; (2) DNA 
elements that control initiation of transcription, such as a promoter; (3) DNA 
elements that control the processing of transcripts, such as introns, transcription 
termination/polyadenylation sequence; and (4) a reporter gene that is operatively 
linked to the DNA elements to control transcription initiation. Useful reporter 

20 genes include /3-galactosidase, chloramphenicol acetyl transferase, luciferase, 
green fluorescent protein (GFP) and the like. 

V. Methods to synthesize glucoseH3(L4VN-acetvlglucosamine moieties: N- 
acetvlgalactosamine^i8fL4)-N-acetvlducosamine moieties: N- 

25 acetvlgalact osamTne-/9 (l,4)-glucose moieties: N-acetvlglucosamine^j8(L4)-N" 
acetvlglucosamine moieties: mannose-i8(L4)-N-acetvlglucosamine moieties: 
and galactose-ffd ,4)-N-acetylglucosamine-6-SO^moieties. 

Catalytic domains of the invention having altered donor and acceptor 
specificity can be used to catalyze the linkage of numerous sugars from a donor 

30 to numerous acceptor sugars. Linkage of sugar derivatives can also achieved 
through use of the altered catalytic domains of the invention due to then- 
expanded donor and acceptor specificity. 

For example, the catalytic domains of section LA can be used to catalyze 
the linkage of glucose and N-acetylglucosamine; the catalytic domains of section 

as 
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IB can be used to catalyze the linkage of N-acetylgalactosamine and N- 
acetylglucosamine; many of the catalytic domains described herein can be used 
in association with a-lactalbumin to catalyze linkage of a sugar to glucose, as 
described in section IC; the catalytic domains of section ID can be used to 
5 catalyze the linkage of N-acetylglucosamine and N-acetylglucosamine, N- 
acetylgalactosamine to N-acetylglucosamine, and mannose to N- 
acetylglucosamine; and the catalytic domains of section IE can be used to 
catalyze the linkage of a donor and an acceptor having a bulky side-group, such 
as linking galactose to N-acetylglucosamine-6-S0 3 . 

10 Acceptors may be free in solution or linked to another molecule. For 

example, an acceptor may be linked to a protein, another sugar, a sugar 
derivative, and the like. An acceptor may also be linked to a solid support that 
provides a platform to which donors may be added sequentially to fonn 
oligosaccharides, and derivatives thereof, having a specified sequence. 

1 5 Generally, the linkage between a donor and an acceptor is accomplished 

by incubating a catalytic domain of the invention with a desired donor and a 
desired acceptor under conditions of appropriate temperature, pH, and divalent 
metal concentration to allow linkage of the donor to the acceptor. 

For example, the galactose and N-acetylgalactosyltransferase activity of 

20 0(1 ,4)-galactosyltransferase I can be determined using the following assay 

conditions. A 100 /il incubation mixture containing 50 mM jS-benzyl-GlcNAc, 
10 mM MnCl 2 , 10 mM Tris-HCl (pH 8.0), 500 nM UDP-Gal or UDP-GalNAc, 
and 20 ng of /?(l,4)-galactosyltransferase I can be incubated at 37°C for 10 
minutes to promote coupling of a donor sugar to an acceptor sugar. While these 

25 conditions are provided as an example, it is understood that many other 
conditions maybe used to chemically link a donor to an acceptor using the 
altered catalytic domains of the invention. 



VI. Methods to prepare oligosaccharides. 
30 The invention provides methods to synthesize oligosaccharides, 

especially oligosaccharides having preselected sequences though use of the 
altered catalytic domains of the invention. Generally, the methods involve the 
sequential addition of a sugar, or derivative thereof, to the end of a growing 



WO 2004/063344 



PCT/US2004/000470 



oligosaccharide chain. Such methods have been described using enzymes other 
than those of the invention (U.S. Patent No: 6,284,493). 

Briefly, a donor and an acceptor may be incubated with an altered 
catalytic domain of the invention under conditions that allow the donor to be 
5 linked to the acceptor. These conditions are described in the examples section 
herein. 

In one example, the donor and the acceptor may be combined with a 
catalytic domain of the invention in solution. The solution is then incubated to 
allow the donor to be linked to the acceptor. The newly linked molecule may be 
10 isolated and then added to a second solution containing a second donor and a 
second transferase enzyme. This cycle may be repeated with a specific donor 
added at each cycle such that an oligosaccharide having a specific sequence is 
produced. 

In another example, an acceptor may be linked to a solid support. The 

1 5 solid support may then be immobilized in a structure such as a column or a tray. 
A donor and a catalytic domain of the invention can then be incubated with the 
immobilized acceptor under conditions that allow the donor to be linked to the 
acceptor. The solid support is then washed to remove any unlinked donor and 
catalytic domain present. A second donor and a second transferase enzyme can 

20 then be added and incubated under conditions that allow the donor to be linked 
to the acceptor. This cycle can be repeated to allow for the rapid and large-scale 
production of oligosaccharides having defined sequences. In addition, this 
method may be readily adapted for use in an automated system. This system 
may be used without the need for protecting groups on the acceptor or the donor 

25 due to the use of enzymes that catalyze the linkage of a given donor to a given 
acceptor. Accordingly, an advantage of this method is that mild reaction 
conditions may be used that do not damage the growing oligosaccharide chain. 
Another advantage of the method is the short cycling time required to add 
monomers onto the growing oligosaccharide chain due to the lack of a need to 

30 protect and deprotect the growing oligosaccharide chain. 

Other methods for using the catalytic domains of the invention to 
synthesize sequences of predetermined oligosaccharides and derivatives thereof 
may also be used. However, these methods will utilize an galactosyltransferase 



WO 2004/063344 



PCT/US2004/000470 



having an altered acceptor site, an altered donor site, or altered donor and 
acceptor sites. 

VII. Methods to increase the immunogenicitv of an antigen. 
5 The invention provides methods to increase the immunogenicity of an 

antigen. Generally, the methods involve incubating an antigen with a catalytic 
domain of the invention such that a sugar is transferred from a donor to an 
acceptor through the action of a catalytic domain. 

The methods of the invention may be used in association with nearly any 

1 0 acceptor containing material against which an immune response is desired. For 
example, sugars may be transferred from a donor to an acceptor that is linked to 
a whole cell. The cell can then be killed through irradiation or chemical means 
and administered to an animal to elicit an immune response. Cell membranes 
may be used in a similar manner. Methods to create an immune response against 

15 cells and cell membranes are described in U.S. Patent No. 6,361,775. 

The immunogenicity of a virus or subunit thereof may be increased 
according to the methods of the invention and used as an improved vaccine. For 
example, for a virus that contains a glycoprotein as a component of the virion, 
one or more sugars may be added to the glycoprotein by propagation of the virus 

20 in a cell that expresses a catalytic domain of the invention. Alternatively, one . or 
more sugars may be directly added to the glycoprotein using a catalytic domain 
of the invention. Furthermore, there exist viruses without envelopes that contain 
complex carbohydrates. Sugars may be added onto these carbohydrates through 
use of a catalytic domain of the invention. 

25 Viral subunits may be obtained from virions using biochemical methods 

or they can be expressed by recombinant means in suitable eukaryotic cells. 
Methods of expressing viral subunits are common in the art. These methods 
may vary according to the type of virus used. For example, methods of 
expressing viral subunits are described in the following articles and in the 

30 references cited therein: Possee, Virus Research, 5:43 (1986); Kuroda et al., 
EMBO J.. 5: 1359 (1986); Doerfler, Curr. Tonics Microbiol. Immunol.. 131:51 
(1986); Rigby, J. Gen. Virol., 64:255 (1983); Mackett et al., In: DNA Cloning, A 
Practical Approach, Vol II, Ed. D. M. Glover, IRL Press, Washington, D.C. 
(1985); Rothestein, In: DNA Cloning, A Practical Approach, Supra (1985); 
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Kinney et al., J. Gen. Virol.. 69:3005 (1988); Panical et al., Proc. Natl. Acad. 
ScL 80:5364 (1983); Small et al., In: Vaccinia Viruses as Vectors for Vaccine 
Antigens, pp. 175-178, Ed. J. Quinnan, Elsevier, N.Y (1985). 

Viruses for which vaccines are currently available and whose 
5 immunogenicity can be improved according to the methods of the invention 
include influenza virus (Orthomyxovirus); rabies virus (Rhabdovirus); hepatitis 
B virus (Hepadnavirus); eastern, western and Venezuelan equine encephalitis 
virus (Togavirus/Alphavirus genus); and, Japanese encephalitis virus, tick-borne 
encephalitis virus and Russian spring-summer encephalitis virus (Flavivirus); 

10 Rift Valley fever virus (Bunyavirus)(reviewed: Melnick, In: High Technology 
Route to Virus Vaccines Ed. Dreesman, Bronson and Kennedy, American 
Society for Microbiology, Washington, D.C. (1985); Ogra et al., Prog. Med. 
Virol.. 37:156 (1990)). Viruses for which vaccines are not yet commercially 
available but which may also be useful when treated according to the methods of 

15 the invention include, but are not limited to human immunodeficiency and 

human T-cell leukemia viruses (Retrovirus); respiratory syncytial virus and other 
paramyxoviruses (Paramyxovirus); herpes simplex viruses types 1 and 2, 
varicella zoster virus, cytomegalovirus and other herpes viruses (Herpesviruses); 
dengue virus and Saint Louis encephalitis virus (Flavivirus); hantaan virus 

20 (Bunyavirus); Lassa virus (Arenavirus); and, rotavirus (Reovirus). In addition, 
there exist viral vaccines comprising live attenuated viruses the administration of 
which to humans is associated with some measure of risk of mild to severe side 
effects. It is now possible according to the methods of the invention to enhance 
the immunogenicity of killed virus vaccines which may serve to reduce the use 

25 of their live, more risk-associated counterparts. Thus, vaccines which currently 
comprise live virus, such as measles virus, mumps virus, rubellavirus etc. are all 
encompassed by the invention. 

Vaccines are prepared by suspension of a suitable concentration of an 
antigen that has been linked to a sugar, through the action of a catalytic domain 

30 of the invention, in a pharmaceutically acceptable carrier. The composition of 
the carrier will depend upon the type of vaccine and the route of administration 
and will be readily apparent to one skilled in the art. The vaccine may be 
administered in a dose of 10 2 to 10 9 cells per dose (in the case of whole cells) or 
a similar equivalent of cell membranes. In the case of viral vaccines, virus may 

9f\ 
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be administered in dose ranging from 1 fig to 50 mg of virus per dose, or a 
similar equivalent of subunit. Determination of the appropriate dosage of 
vaccine will be apparent to one of skill in the art and will depend upon the 
antigens comprising the vaccine, the age of the patient and their general and 
5 immunological health. 

In order to improve the efficacy of vaccines prepared according to the 
methods of the invention, patients may be pretreated with adjuvant at the site of 
vaccination several days prior to administration of vaccine. Pretreatment with 
adjuvant serves to induce migration of macrophages to the site of inoculation, 

10 thereby enhancing the rate of phagocytosis of treated antigens. In the case of 
viral vaccines, patients may be treated with adjuvant and vaccine 
simultaneously. Adjuvants suitable for this purpose include aluminum 
hydroxide and like adjuvants. 

Vaccines are administered to a mammal, particularly a human, either 

15 subcutaneously, intramuscularly, orally, intravenously, intradermally, 

intranasally or intravaginally. Prior to oral administration, the vaccine can be 
mixed with a solution containing.a sufficient amount of sodium bicarbonate or 
other suitable compound capable of neutralizing stomach acid (approximately 2 
grams). Alternatively, the vaccine usually in lyophilized form, can be 

20 formulated as tablets which are treated with a coating capable of resisting 
stomach acid. 
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Table I 



Exemplary Amino Acid Sequences 



SEQ 

ID 

NO 


Accession 
Number 


Description 


Amino Acid Sequence 


3 


BAA06188 


Human 

0(1, ^galactosyl- 
transferase 
(398 AA) 


MRLREPLLSRSAAMPGASLQRACRLLVAVCA 

LHLGVTLVYYLAGRDLSRLPQLVGVSTPLQG 

GSNSAAAIGQSSGDLRTGGARPPPPLGASSQP 

RPGGDSSPVVDSGPGPASNLTSVPVPHTTALS 

LPACPEESPLLVGPMLIEFNMPVDLELVAKQN 

PNVmGGRYAPRDCVSPHKVAmPFRNRQEH 

LKYWLYYLHPVLQRQQLDYGIYVINQAGDTI 

FNRAKLLNVGFQEALKDYDYTCFVFSDVDLI 

PMNDHNAYRCFSQPRffiSVAMDKFGFSLPYV 

QYFGGVSASSKQQFLTINGFPNNYWGWGGE 

DDDIFNRLVFRGMSISRPNAVVGTCRMIRHSR 

DKKNEPNPQRFDRIAHTKETMLSDGLNSLTY 

QVLDVQRYPLYTQITVDIGTPS 


4 


A33396 


Mouse 

j8(l, ^galactosyl- 
transferase 
(399 AA) 


MRFREQFLGGSAAMPGATLQRACRLLVAVC 

ALHLGVTLVYYLSGRDLSRLPQLVGVSSTLQ 

GGTNGAAASKQPPGEQRPRGARPPPPLGVSP 

KPRPGLDSSPGAASGPGLKSNLSSLPVPTTTG 

LLSLPACPEESPLLVGPMLIDFNIAVDLELLAK 

KNPEIKTGGRYSPKDCVSPHKVAmPFRNRQE 

HLKYWLYYLHPILQRQQLDYGIYVINQAGDT 

MFNRAKLLNIGFQEALKDYDYNCFVFSDVDL 

IPMDDRNAYRCFSQPRHISVAMDKFGFSLPY 

VQYFGGVSALSKQQFLAINGFPNNYWGWGG 

EDDDIFNRLVIIKGMSISRPNAVVGRCRMIRH 

SRDKKNEPNPQRFDRIAHTBCETMRFDGLNSL 

TYKVLDVQRYPLYTQITVDIGTPR 


5 


S05018 


Bovine 

jS(l ^galactosyl- 
transferase 
(402 AA) 


MKFREPLLGGSAA3VIPGASLQRACRLLVAVC 

ALHLGVTLVYYLAGRDLRRLPQLVGVHPPLQ 

GSSHGAAAIGQPSGELRLRGVAPPPPLQNSSK 

PRSRAPSNLDAYSHPGPGPGPGSNLTSAPVPS 

TTTRSLTACPEESPLLVGPMLIEFNIPVDLKLIE 

QQNPKVKLGGRYTPMDCISPHKVAniLFRNR 

QEHLKYWLYYLHPMVQRQQLDYGIYVINQA 

GESMFNRAKLLNVGFKEALKDYDYNCFVFS 

DVDLIPMNDHNTYRCFSQPRfflSVAMDKFGF 

SLPYVQYFGGVSALSKQQFLSINGFPNNYWG 

WGGEDDDIYNRLAFRGMSVSRPNAVIGKCR 

MIRHSRDKKNEPNPQRFDRIAHTKETMLSDG 

LNSLTYMVLEVQRYPLYTKITVDIGTPS 



3\ 
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6 




Human 

mm i <>vix 

Stem Region.of 
j8(l, ^galactosyl- 
transferase 


RDT SRLPOLVGVSTPT OGGSNSAAAIGOSSGD 
LRTGGARPPPPLGASSQPRPGGDSSPWDSGP 
GPASNLTSVPVPHTTALSLPACPEESPLLVGP 
MLIEFNMPVDLELVAKQ 


•7 




Bovine 

Stem Region of 
^(^^galactosyl- 
transferase 


RDLRRLPOLVGVWPLOGSSHGAAAIGOPSG 

XVX-'X-/X\XVXj.I V^JL/ V VJ V X IX ± X^\£ VJkJOX XXwI/^^VrvJLVJ V(X O VJ 

ELPvLRGVAPPPPLQNSSKPRSRAPSNLDAYSH 
PGPGPGPGSNLTSAPVPSTTTR 


8 




Human Catalytic 
Domain of 
0(1, ^galactosyl- 
transferase 


SLPACPEESPLLVGPMLIEFNMPVDLELVAKQ 
NPNVKMGGRYAPRDCVSPHKVAmPFRNRQE 
HLKYWLYYLHPVLQRQQLDYGIYVINQAGD 
TIFNRAKLLNVGFQEALKDYDYTCFVFSDVD 
LPMNDHNAYRCFSQPRHISVAMDKFGFSLP 
i v i ruu v o AoaK^fL iiiNurrlNlN I Wuvvu 
GEDDDIF]Sn^VFRGMSK 
HSRDxsJCNEPNPQRTO^ 
LTYQVLDVQRYPLYTQITVDIGTPS 


9 




Bovine Catalytic 
Domain of 
j8(l, ^galactosyl- 
transferase 


SLTACPEESPLLVGPMLIEFNIPVDLKLffiQQN 

PKVKLGGRYTPlVlDCISPl^VAniLFRx^ 

LKYWLYYLOTMVQRQQLDYGIYVINQAGES 

MFNRAKLL1WGFKEAL1<^ 

LIPIVlNDHNTYR 

VOYFGGVSALSKOOFLSli^GFPNNTYWGWGG 
EDDDIYMU^FRGMSVSRPNAVIGKCRMIRH 
SRD1<^EPNPQRFDR1A1T1^TMLSDGLNSL 
TYMVLEVQRYPLYTKITVDIGTPS 



3P- 



WO 2004/063344 



PCTAJS2004/000470 



Table n 



Exemplary Nucleic Acid Sequence 



SEQ 


Accession 


Description 


Nucleic Acid Sequence 


ID 


Number 






NO 








10 


D29805 


Human 

0(1 ^galactosyl- 
transferase 


ATGAGGCTTCGGGAGCCGCTCCTGAGCCGGA 

GCGCCGCGATGCCAGGCGCGTCCCTACAGCG 

GGCCTGCCGCCTGCTCGTGGCCGTCTGCGCTC 

TGCACCTTGGCGTCACCCTCGTTTACTACCTG 

GCTGGCCGCGACCTGAGCCGCCTGCCCCAAC 

TGGTCGGAGTCTCCACACCGCTGCAGGGCGG 

GTCGAACAGTGCCGCCGCCATCGGGCAGTCC 

TCCGGGGACCTCCGGACCGGAGGGGCCCGGC 

CGCCGCCTCCTCTAGGCGCCTCCTCCCAGCCG 

CGCCCGGGTGGCGACTCCAGCCCAGTCGTGG 

ATTCTGGCCCTGGCCCCGCTAGCAACTTGACC 

TCGGTCCCAGTGCCCCACACCACCGCACTGTC 

GCTGCCCGCCTGCCCTGAGGAGTCCCCGCTG 

CTTGTGGGCCCCATGCTGATTGAGTTTAACAT 

GCCTGTGGACCTGGAGCTCGTGGCAAAGCAG 

AACCCAAATGTGAAGATGGGCGGCCGCTATG 

CCCCCAGGGACTGCGTCTCTCCTCACAAGGT 

GGCCATCATCATTCCATTCCGCAACCGGCAG 

GAGCACCTCAAGTACTGGCTATATTATTTGCA 

CCCAGTCCTGCAGCGCCAGCAGCTGGACTAT 

GGCATCTATGTTATCAACCAGGCGGGAGACA 

CTATATTCAATCGTGCTAAGCTCCTCAATGTT 

GGCTTTCAAGAAGCCTTGAAGGACTATGACT 

ACACCTGCTTTGTGTTTAGTGACGTGGACCTC 

ATTCCAATGAATGATCATAATGCGTACAGGT 

GTTTTTCACAGCCACGGCACATTTCCGTTGCA 

ATGGATAAGTTTGGATTCAGCCTACCTTATGT 

TCAGTATTTTGGAGGTGTCTCTGCTTCAAGTA 

AACAACAGTTTCTAACCATCAATGGATTTCCT 

AATAATTATTGGGGCTGGGGAGGAGAAGATG 

ATGACATTTTTAACAGATTAGIlll'lAGAGGC 

ATGTCTATATCTCGCCCAAATGCTGTGGTCGG 

GACGTGTCGCATGATCCGCCACTCAAGAGAC 

AAGAAAAATGAACCCAATCCTCAGAGGTTTG 

ACCGAATTGCACACACAAAGGAGACAATGCT 

CTCTGATGGTTTGAACTCACTCACCTACCAGG 

TGCTGGATGTACAGAGATACCCATTGTATAC 

CCAAATCACAGTGGACATCGGGACACCGAGC 

TAG 
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Example I 

General Expression, Mutagenesis, Folding, and Purification of 
Catalytic Domains of Bovine j8(l,4)-gaIactosyltransf erase I 

5 Materials and Methods 

Bacterial growth and plasmid transformations were performed using 
standard procedures (Ausubel et al., Current Protocols in Molecular Biology, 
Greene Publishing Associates and Wiley-Interscience, New York (1987)). The 
plasmid pEGT-dl29, which encodes the catalytic domain (residues 130-402) of 

1 0 bovine j3(l ,4)-galactosyltransferase I, was used for mutation. Site-directed 
mutagenesis was performed using a CLONTECH site-directed mutagenesis 
transformer kit. The transformation mixture contained the template pEGT-dl29, 
a selection primer, and a mutagenic primer for creation of a desired mutant. 
Mutants were screened for the incorporated mutations by looking for changes in 

1 5 restriction enzyme digestion patterns and confirmed by DNA sequencing. The 
positive clones were transformed into B834(DE3)pLysS cells. The primers were 
synthesized by the Molecular Technology Laboratory, NCI-Frederick. 

The expression and purification of the inclusion bodies were carried out 
as described previously (Ausubel et al., Current Protocols in Molecular Biology, 

20 Greene Publishing Associates and Wiley-Interscience, New York (1987)). The 
inclusion bodies were ^-sulfonated by dissolving in 5 M GdnHCl, 0.3 M sodium 
sulfite, and the addition of di-sodium 2-nitro-5-thiosulfobenzoateto a final 
concentration of 5 mM. The sulfonated protein was precipitated by dilution with 
water, and the precipitate was washed thoroughly. 

25 The sulfonated protein was re-dissolved in 5 M GdnHCl to a protein 

concentration of 1 mg/ml (1 .9-2.0 optical density at 275 nm). The protein 
solution was diluted 10-fold, in 10 portions, in a folding solution to give a final 
concentration of 100 ^g/ml /3(l,4)-galactosyltransferase 1, 0.5 M GdnHCl, 
50 mM Tris-HCl, 5 mM EDTA, 4 mM cysteamine, and 2 mM cystamine, pH 

30 8.0 at 4°C. It was left at 4 °C for 48 h to allow the protein to fold and then 

dialyzed against 3x4 liters of water containing 50 mM Tris-HCl, pH 8.0, 5 mM 
EDTA, 4 mM cysteamine, and 2 mM cystamine at 4°C to remove GdnHCl. Any 
protein that precipitated during dialysis was removed by centrifugation, and the 
supernatant was concentrated. Typically, when 100 mg of sulfonated protein was 

3^- 
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folded in a 1-liter folding solution, it yields 2-5 and 10-12 mg of the active, 
soluble, and pure wild-type j9(l,4)-galactosyltransferase I catalytic domain, or 
mutant thereof. The folded proteins were purified further on an LA-agarose 
column (Sigma). The final, purified protein had a specific activity that was 
5 slightly higher than that of purified milk 0(1 ,4)-galactosyltransferase I. 

Site-directed Mutagenesis of amino acid position 289 of the bovine 
0(l>4)-galactosyltransferase I: Site-directed mutagenesis was performed using 
the PCR method. Construction of the mutants was done using plasmid pEGT- 
dl29 as the template; this contains a.BamHl/EcdRl fragment inserted into 
10 pEGT23a vector, coding for the residues 130-402 of bovine j8(l,4)- 
galactosyltransferase I, and has a Cys-342 to Thr mutation. 

The mutation primers corresponding to the upper DNA strand are: 
Y289L, 

5'-CCTTACGTGCAATTGTTTGGAGGTGTCTCTGCTCTAAGTAAA-3'(SE 
15 Q ID NO: 11) and 

5 '-GACACCTCCAAACAATTGCACGTAAGGTAGGCTAAA-3 ' (SEQ ID 
NO: 12); Y289I, 

5 '-CTACCTTACGTGCAGATCTTTGGAGGTGTCTCTGCTCTAAG-3 ' (SEQ 
ID NO: 13) and 

20 5 '-GACACCTCCAAAGATCTGCACGTAAGGTAGGCTAATCCAA-3 ' (SEQ 
ID NO: 14);Y289N, 

5 '-G<}ATTAGCCTACCATATGTGCAGAATTTTGGAGGTGTCTCT-3 ' (SEQ 
ID NO: 15) and 

5 '-AGAGACACCTCCAAAATTCTGCACATCTGGTAGGCTAAATCC-3 ' 
25 (SEQ ID NO: 16). Typically, the entire Gal-Tl DNA was PCR-amplified as two 
fragments using the terminal cloning primers and two mutagenesis primers. The 
fragments were then cut with the restriction enzymes Mfel, BglR, and Ndel for 
Y289L, Y286I, and Y286N mutants, respectively, and ligated. The full Gal-Tl 
DNA with the mutation was amplified from the ligation mixture using the 
30 cloning primers and then inserted into the pET28a vector. Mutants were 

screened for the incorporated mutations, based on alterations in the restriction 
enzyme digestion patterns, and then sequenced. The positive clones'were 
transformed into B834(DES)pLys8 cells as described previously (Ramakrishnan 
et al., J. Biol. Chem .. 270. 87665-376717 (2001)). The mutant proteins were 
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expressed and purified according to the published method (Ramakrishnan et al., 
J. Biol Chem.. 270, 87665-376717 (2001)). 

Gal-T and GalNAc-T Enzyme Assays: The protein concentrations 
were measured using the Bio-Rad protein assay kit, based on the method of 
5 Bradford and further verified on SDS gel. An in vitro assay procedure for the 
Gal-Tl has been reported previously (Ramakrishnan et al., J. Biol. Chem.. 270, 
87665-376717 (2001)). The activities were measured using UDP-Gal or 
UDP-GalNAc as sugar nucleotide donors, and GlcNAc and Glc as the acceptor 
sugars. For the specific activity measurements, a 100-//1 incubation mixture 

1 0 containing 50 mM /?-benzyl-GlcNAc, 1 0 raM MnCl 2 , 1 0 mM Tris-HCl, pH 8.0, 
500//M UDP-Gal or UDP-GalNAc, 20 ng of Gal-Tl, and 0.5 juCl of [ 3 H]UDP- 
Gal or [ 3 H]UDP-GalNAc was used for each Gal-T or GalNAc-T reaction. The 
incubation was carried out at 37 °C for 10 min. The reaction was terminated by 
adding 200 /A of cold 50 mM EDTA, and the mixture was passed through a 0.5- 

15 ml bed volume column of AG1-X8 cation resin (Bio-Rad) to remove any 
unreacted [ 3 H]UDP-Gal or [ 3 H]UDP-GalNAc. The column was washed 
successfully with 300, 400, and 500 //l of water, and the column flow-through 
was diluted with Biosafe scintillation fluid; radioactivity was measured with a 
Beckman counter. A reaction without the acceptor sugar was used as a control. 

20 A similar assay was carried out to measure the GalNAc-T activity with Glc and 
other acceptors in the presence of 50 jum bovine LA (Sigma). 

Studies for Determining the Kinetic Constants: The true K m of the 
donor (K A ) and of the acceptor {K B \ the dissociation constant of the donor, K i(a)9 
and k ca t were obtained using two-substrate analyses and the primary plots of five 

25 concentrations of donor (UDP-Gal or UDP-GalNAc) and five concentrations of 
acceptor, and the corresponding secondary plots of the intercepts and slopes. 
Initial rate conditions were linear with respect to time. A suitable range of donor 
and acceptor concentrations, were chosen, which allowed an accurate Michaelis- 
Menten plot to be derived. The data were also analyzed for a general two- 

30 substrate system using the following equations ( Zhang et al., Glvcobiologv, 

9:815-822 (1999)) with the software EnzFitter, a Biosoft nonlinear curve-fitting 
program for Windows. 
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Equation 1 

K lia) K B +K A [A]+K B [B]+[4.B] 
Equation 2 

k a [a]+k b [b]+[aIb] 

5 

Here v is the initial velocity and the rate equation for sequential symmetrical 
initial velocity pattern associated with Equation 1 , an ordered or random 
equilibrium mechanism in which substrate A dissociates well from the E-S 
complex with a dissociation constant of K { ( a y Equation 2 is for asymmetric 

10 initial velocity pattern for a double displacement or "ping-pong" mechanism. 
The kinetic parameters K A , K B , and V mui , were obtained from the fitted 
curves using the above rate equations. The graphical method and EnzFitter 
program gave very similar kinetic parameters. In the N-acetylgalactosamine 
transferase (GalNAc-T ) assay, the maximum substrate concentrations used for 

15 UDP-GalNAc and GlcNAc were 1 and 200 mM, respectively. However, in the 
galactose transferase (Gal-T) assay, because of the limited solubility of GlcNAc 
in water, the concentration of GlcNAc was limited to no more than 400 mM 
(which is 2-fold higher than its K,„ value), whereas up to 300 jM UDP-Gal were 
used. 

20 *H NMR Spectroscopy of the Products of GalNAc-T Activity: The 

reaction was carried out in a total volume of 1 ml that contained 100 //g of 
Y289L mutant plus 10 mM each triethanolamine-HCl, pH 8.0, UDP-GalNAc, 
GlcNAc, and MnCl 2 at 37 °C for 48 h. The mixture was first passed through a 
1-ml Chelex 100 column and then through a 1-ml cationic column containing 
25 AG1-X8 resin (200-400-mesh). The disaccharide product in the flow-through 
from 4 bed volumes was pooled and freeze-dried. The product was finally 
dissolved in 400 fA of D 2 0, and its ] H NMR spectrum was obtained in a 400- 
MHz NMR spectrometer. All the NMR spectra were recorded in the Analytical 
Chemistry Laboratory, NCI-Frederick (Frederick, MD). 

30 

Results 
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Comparison of Gal-T and GalNAc-T Catalytic Activities of Wild- 
type and Gal-Tl Mutants: The mutants Y289L, Y289N, and Y289I exhibited 
both galactose transferase and N-acetylgalactosyltransferase activities (Table EH 
and Fig. 1). The mutants Y289L and Y289N exhibited equally strong galactose 
5 transferase and N-acetylgalactosyltransferase activities. The specific N- 
acetylgalactosyltransferase activity of the mutant Y289I at the substrate 
concentration used was half that of Y289L (Table IH). The Y289I mutant 
exhibited a slightly weaker affinity to UDP-GalNAc (Fig. 1). The Asn 
substitution exhibited GalNAc-T activity as well as the Leu substitution. Wild- 

10 type-Gal-Tl exhibits a very low glucosyltranferase activity, but no N- 

acetylglucosaminyltransferase activity (Berliner and Robinson, Biochemistry. 
21:6840-68433 (1982); Palmin and Hindaganl, Glvcobiologv. 1:205-209 (1991); 
Ramakrishnan et al., J. Biol. Chem.. 270:87665-37671 (2001)). On the contrary, 
the mutants exhibited reasonable iV-acetylglucosaminyltransferase activity 

1 5 (Table IH) where they transfer i^-acetylglucosamine from TJDP-iV- 

acetylglucosamine to the acceptor W-acetylglucosamine but do not exhibit 
glucosyltranferase activity. In this A^-acetylglucosaminyltransferase activity the 
initial product, the disaccharide GlcNAc ,01,4- GlcNAc, itself is an acceptor for 
the enzyme, thus producing tri- and longer chain saccharides. 

20 
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Table HI 

Specific activities of the catalytic domain of Gal-Tl (residues 130-402) with the 
C342T mutation and the Tvr-289 mutants 

Enzyme UDP-galactose -> GlcNAc UDP-GalNAc GlcNAc UDP-GlcNAc ->• GlcNAc 



Gal-T activity GalNAc-T activity GlcNAc-T activity 

pmol/min/ng pmol/min/ng pmol/min/ng 

C342T H.8 0.1 0 

Y289L_C342T 16.2 , 27.3 8.2 

Y289I_C342T 8.8 14.7 6.2 



Reactions were performed under saturating conditions of all substrates 
5 (assay conditions are described herein). In these reactions, the concentration for 
the donors was 500//M and the acceptor /?-benzyl-GlcNAc was 25 mM. In the 
Gal-T reaction of C342T, the acceptor concentration was 10 mM, because at 25 
mM it showed inhibition. Because Y289N rapidly undergoes denaturization, 
reliable data could not be obtained. 

10 

Because the Y289L variant of Gal-Tl exhibits both Gal-T and GalNAc-T 
enzyme activities with equal efficiencies, double substrate kinetic studies were 
carried out to determine the kinetic constants for both donor and acceptor 
molecules. The kinetic data from both reactions fit best to Equation 2, with a 

1 5 zero Ki(„) value, describing an asymmetric initial velocity pattern for a double- 
displacement or "ping-pong" mechanism. The Y289L mutant in the Gal-T 
reaction showed nearly a 20-fold higher K m for the acceptor than the wild-type 
Gal-Tl (Table IV). The catalytic constant (£<*,) in the Gal-Tl reaction is 
comparable with the wild type, but it is nearly 3-5-fold higher in the GalNAc 

20 reaction (Table IV). 
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Table IV 

Kinetic parameters for the donor and the acceptor substrates b v Y289L C342T 
mutant in the Gal-T and the GalNAc-T catalytic reactions 



Mutant 


Gal-T UDP-galactose -> GlcNAc 


GalNAc-T UDP-W-acetylgalactosamine -» 
GlcNAc 




Ka 


Kb 


Kcat. 


K A 


Kb Kcat. 




JUM 


Mm 




jlM 


mM 


C342T 


98(6) 


11(1) 


14 


ND 


ND ND 


Y289LC342T 


75 (1) 


198(1) 


8.5 


854 (8) 


51 (1) 4.0 



ND, not determined 



5 Because the Y289L mutant exhibits equally high GalNAc-T activity as it 

does Gal-T activity, the disaccharide product from this reaction was purified and 
analyzed by l H NMR spectroscopy (Fig. 4). The NMR results demonstrated that 
the mutant Y289L transfers GalNAc from UDP-GalNAc to GlcNAc, forming a 
/ft -4 linkage between GalNAc and GlcNAc. 

10 A Point Mutation in the Codon for Amino Acid 289 Can Convert the 

Enzyme with Dual Property: In humans, the /?4Gal-T family has seven 
members (Gal-T 1 to -T7), each with high sequence identity within their catalytic 
domain (Lo et al., Glvcobioloev. 8:517-526 (1998); Nomura et al., J. Biol. 
Chem.. 273:13570-13577 (1998)). These family members transfer Gal from 

15 UDP-Gal to different sugar acceptors. For example, Gal-T6 transfers Gal to Glc 
of Glc-ceramide, whereas Gal-T7 transfers Gal to xylose (Nomura et al., J. Biol. 
Chem.. 273:13570-13577 (1998)). Among these seven members, four members, 
Gal-Tl to Gal-T4, have a Tyr residue at position 287 (which corresponds to Tyr- 
289 in bovine Gal-Tl), whereas Gal-T5 and Gal-T6 have a Phe residue. 

20 



Ad 



WO 2004/063344 



PCT/US2004/000470 



Table V 

Codon usage for the amino acid at position 289 among the Gal-Tl family 

members 





Codon 


Amino acid at position 289 


d ovine ijai-i i 


TAT* 

1A1 


Tyr 


Human Gal-Tl 


TAT 


Tyr 


Human Gal-T2 


TAC 


Tyr 


Human Gal-T3 


TAC 


Tyr 


Human Gal-T4 


TAT 


Tyr 


Human Gal-T5 


TTT 


Phe 


Human Gal-T6 


TTC 


Phe 


Human Gal-T7 


TAC 


Tyr 



5 



In human Gal-Tl, the corresponding amino acid is at position 287. The 
second and third nucleotides of the codon show variations among the family 
members, while strictly conserving the first nucleotide. Mutation of the first 
nucleotide of the codon (shown in bold) to either A or C would result in a Leu or 
10 Asn residue instead of Tyr, and such a mutant would exhibit dual enzymatic 
activities. 

Role of a-Lactalbumin (LA) in the GalNAc-T Activity of the wt-Gal- 
Tl and the Y289L Mutant: LA enhances the transfer of GalNAc from UDP- 

15 GalNAc to GlcNAc by Gal-Tl with -0.1% of its Gal-T efficiency, a situation 
similar to glucosyltransferase activity (Glc-T) of Gal-Tl, where LA increases 
this activity from 0.3 to 10% (Ramakrishnan et al., J. Biol. Chem.. 270:87665- 
37671 (2001)). LA plays a kinetics role in stimulating Gal-Tl to transfer Glc 
from the UDP-Glc to GlcNAc (Ramakrishnan et al., J. Biol. Chem.. 270:87665- 

20 37671 (2001); Ramakrishnan et al., Biochem Biophvs. Res. Commun.. 

291:1 1 13-1 1 18 (2002)). The activity is enhanced by approximately 1%. This 
level of activity was determined using the donor concentration of the unlabeled 
UDP-sugar, and only a small amount of 3 H-labeled UDP-sugar. Use of this 
protocol allowed for accurate calculation based on the amount of unlabeled 

25 UDP-sugar used. 

Unlike wt-Gal-Tl, where LA stimulates the transfer of GalNAc only to 
GlcNAc and not to Glc (Do et al., Biol. Chem.. 276:18447-18451 (1995)), the 

4| 
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Tyr-289 mutants (Y289L, Y289N, and Y289I) in the presence of LA transfer 
GalNAc preferably to Glc rather than to GlcNAc. This property is quite similar 
to LS activity in which the wt-Gal-Tl in the presence of LA transfers Gal to Glc 
instead to GlcNAc (Table VI). Furthermore, like wt-Gal-Tl, these mutants also 
5 transfer Gal to Glc in the presence of LA. For example, the 

A^-acetylgalactosaminyltransferase activity in the bovine mammary gland 
extracts with a similar catalytic property has been identified (Van den 
Nieuwenhof et al., FEBS Lett.. 459:377-380 (1999)). This enzyme also transfers 
GalNAc from UDP-GalNAc to GlcNAc in the absence of LA, whereas in the 
1 0 presence of LA it transfers GalNAc to Glc instead to GlcNAc. 





Table VI 


GalNAc-T catalvtic activity of Y289L mutant in the absence and the presence of 


LA on various acceptor substrates 


Acceptor 


GalNAc-T activity 




-LA +LA 




pmol/min/ng 


GlcNAc 


31.3 3.3 


Glc 


2.6 34.8 


Glucosamine 


1.8 5.0 



Assays were carries out at 37 °C using 500/M UDP-GalNAc for 10 minutes. In 
all the measurements, the acceptor concentration was 100 mM and the LA 
concentration was 55 //M. 



It is shown herein that the Tyr-289 mutants of Gal-Tl exhibit as high 
GalNAc-T activity as Gal-T activity, as well as GlcNAc-T activity. In the 
human Gal-T family members, the Tyr/Phe residue at 287 (or in bovine Tyr-289) 
is important for determining the donor sugar specificity of the enzyme. 
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Example II 

General Expression, Mutagenesis, Folding, and Purification of 
Catalytic Domains of 0(l,4)-galactosyltransferase I enzymes having an 

N-terminal stem region 

5 

Materials and Methods 
Restriction endonucleases and DNA-modifying enzymes were from New 
England Biolabs (Beverly, MA). Oligonucleotide primers were synthesized by 
the Recombinant DNA Facility at NCI-Frederick. The plasmid mini preparation 

10 kit was from Qiagen (Santa Clarita, CA). Ampicillin, UDP-Gal and UDP- 
agarose were from Sigma Chemical Co, MO. pET23a vector and BL21 
(XDE3)/pLysS competent cells were from Novagen (Madison, WT). E. coli 
XL2-Blue ultracompetent cells were from Stratagene (La Jolla, CA). AG1-X8 
resin, chloride form, 200-400 mesh was from Bio-Rad (Hercules, CA). Taq 

1 5 DNA polymerase and PCR nucleotide mixes were obtained from Boehringer 
Mannheim. 

Cloning of bovine and human stem-CD clones of pl,4- 
galactosyltransferase: 

Two sets of oligonucleotide primers with unique restriction sites added at the 5' 

20 (SR) and 3' (CD) ends were synthesized based upon the bovine p4Gal-T 1 and 1 

human p4Gal-Tl cDNA sequence (shown in Figs. 3A and 3B). A full-length 

bovine cDNA clone encoding p4Gal-Tl was used for site-directed mutagenesis 

(Boeggeman et al., Protein Eng.. 6:779-785 (1993)). In the stem region of 

bovine p4Gal-Tl Ser96 was mutated to Ala96 (described below). The mutated 

25 full-length cDNA template was amplified by the polymerase chain reaction 

(PCR), using the following primer pair to generate SRCDP4Gal-Tl: 

(SR) 5'-CGCGGATCCCGCGACCTAAGACGCCTGCCTCAGCTGGTC-3' 
(SEQ ID NO: 17) and 

(CD) 5'-TGGAATTCCTAGCTCGGCGTCCCGATGTCCACTGTGAT-3' (SEQ 
30 ID NO: 18). 

Human placental QUICK-Clone cDNA (Lot # 9020891) was obtained from 
Clontech and used as the template for PCR. To amplify human p4Gal-Tl the 
following PCR primers were used: 
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(SR)5 , -CGCGGATCCCGCGACCTGAGCCGCCTGCCCCAACTGGTC-3 , 
(SEQIDNO: 19) and 

(CD) 5 , -CCGGAA^TCCTACTAGCTCGGTGTCCCGATGTCCACTGT-3 , 
(SEQIDNO: 20). 

5 The primer pairs amplified the cDNA region that has both the stem and catalytic 
domain, and were designed to generate DNA fragments containing a Bam HI 
(SR) and Eco RI (CD) (underlined) at the 5 1 and 3'- ends of the inserts, 
respectively. The PCR products were purified using a QIAquick PCR 
purification kit and the purified fragments were digested with Bam HI and Eco 

10 RI to generate the cohesive ends. The expression constructs were obtained after 
ligating the digested and gel-purified PCR fragments into the Bam HI and Eco 
Rl-digested pET23a vector (Novagen, Madison, WI) and transformed into XL2- 
ultracompetent cells. Identities of the clones were confirmed by DNA sequence. 
Mutation of the stem region of bovine p4Gal-Tl to inhibit 

15 proteolysis: The wild-type bovine SRCDp4Gal-Tl, which has Ser at position 
96, was expressed in E. coll Its sequence is shown in Figure 3 A. The 
recombinant wild-type protein folded and purified from inclusion bodies, as 
described below, was cleaved within the SR region, over a short period of time. 
The cleaved protein on SDS-PAGE shows a single band of molecular mass of 

20 34.9 kDa. The protein was electroblotted to polyvinylidene difluoride 

membrane, and the JV-terminal sequence of the cleaved fragment was determined 
by Edman degradation at the Protein Chemistry Laboratory, NCI-Frederick. The 
iV-terminal sequence of the fragment was determined as SRAPSNLD (SEQ ID 
NO: 21). This corresponds to the cleavage at Ser96 within the sequence region 

25 KPR 96 SRAPSNLD (SEQ ID NO: 22). To prevent this proteolytic digestion 

within the stem region, we now constructed a mutant substituting Ala in place of 
Ser96. This prevented the cleavage within the stem region of bovine [34Gal-Tl 
keeping the recombinant protein intact for a long period of time. Even though, 
the experimental results described in this study are with the mutant bovine 

30 S96A-SRCDp4Gal-Tl (Fig. 3 A), for the sake of convenience in abbreviation 
this protein is simply referred to as bovine SRCDp4Gal-Tl. The human 
SRCDp4Gal-Tl also gets cleaved within the stem region, at Val53 in the 
sequence QLV 53 GVSTPLQ (SEQ ID NO: 23), but only after long periods of 
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time. Since this clone showed more stability than the bovine p4Gal-Tl, it was 
not mutated and used as such for subsequent analysis. 

Protein expression in E. coli and inclusion body isolation: For protein 
expression BL21 (ADE3)/pLysS-competent cells were transformed with the pET 
5 vector derivatives according to the manufacturer's protocols. The transformed 
cells were grown in LB broth containing 50 jig.ml" 1 ampicillin to an OD600 nm 
of - -0.7, followed by induction with 0.4 mM IP TG. Cultures were harvested 
after 3-4 hours by centrifugation at 2000 X g for 20 min. The inclusion bodies 
were isolated and solubilized as described (Boeggeman et al., Protein Eng., 

10 6:779-785 (1993)). From a liter of induced bacterial culture, the yield is 

generally 80 to 100 mg of purified inclusion bodies. Novex gels were used for 
SDS-PAGE analysis and the protein bands were visualized with Coomassie blue. 
Protein concentrations were measured with the Bio-Rad protein dye reagent with 
bovine serum albumin as the standard. 

15 In vitro folding of proteins from inclusion bodies: A protocol utilizing 

oxido-shuffling agents within the renaturation buffer was used to generate active 
recombinant p4Gal-Tl from the inclusion bodies (Boeggeman et al., Protein 
Eng., 6:779-785 (1993)). After renaturation and subsequent dialysis of the 
solution, a portion of the folded protein precipitated in the form of insoluble 

20 aggregates. To increase the yields of native proteins, the inclusion bodies were 
sulfonated as previously described (Boeggeman et al., Glycobiology, 12:395-407 
(2002)). The sulphonated proteins were dissolved in 5 M guanidine-HCl, to a 
protein concentration of 1 mg/ml, with an OD275 nm of 1.9 to 2.0. The protein 
solution was then diluted 10-fold in small aliquots in a renaturation solution to 

25 give a final protein concentration of 100 |ag/ml, in 50 mM Tris-HCl pH 8.0, 5 
mM EDTA, 0.5 M guanidine-HCl, 8 mM cysteamine and 4 mM cystamine. The 
protein was allowed to renature for 48 hrs at 4°C and then dialyzed. The 
precipitate was removed by centrifugation at 5000 rpm and the supernatant 
concentrated using ultrafiltration membranes (Amicon, Inc., Beverly, MA, 

30 USA). 

Improving the folding conditions: In recent years factorial folding 

screens (Rudolph and Lilie, FASEB J.. 10:40-56 (1996); Chen and Gouaux, 

Proc. Natl. Acad. ScL 94:13431-13436 (1997); Armstrong et al, Prot. Sci., 

8:1475-1483 (1999)) have been developed for examining the folding efficiencies 

4S 
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of proteins from inclusion bodies. To improve the in vitro folding efficiency, 8 
different folding conditions similar to the formulations described in the Foldlt 
Screen kit (Hampton Research, CA) with certain modifications were tested. 
Condition I: 50 mM Tris-HCl pH 8.0, 5 mM EDTA, 0.5 M guanidine-HCl, 8 

5 mM cysteamine and 4 mM cystamine. Condition II: 55 Mes pH 6.5, 10.56 mM 
NaCl, 0.44 mM KC1, 2.2 mM MgCl 2 , 2.2 mM CaCl 2 , 0.5 M guanidine-HCl. 
Condition IH: similar to condition II with respect to the buffer, pH, chaotrope 
and salt condition, but it had 0.055% PEG-4000, 1.1 mM EDTA, 0.44 M sucrose 
and 0.55 M L-arginine. Condition TV: 55 mM Mes pH 6.5, 264 mM NaCl, 1 1 

10 mM KC1, 0.055 % PEG-4000, 0.5 M guanidine-HCl, 2.2 mM MgC12, 2.2 mM 
CaC12 and 0.44 M sucrose. Condition V: 55 mM Tris pH 8.2, 10.56 mM NaCl, 
0.44 mM KC1, 1.1 mM EDTA, 0.44 M sucrose. Conditions VI and VIII are 
similar except for the presence of redox agents. Condition VII: 55 mM Mes pH 
6.5, 264 mM NaCl, 1 1 mM KC1, 1.1 mM EDTA, 0.5 M guanidine-HCl, and 0.55 

15 M L-arginine. The buffers H through VH had 100 mM GSH and 10 mM GSSG. 
Conditions I and VDI, had 8 mM cysteamine and 4 mM cystamine. Condition 
Vm, gave the highest enzymatic activity, soluble and folded protein, was 50 mM 
Tris-HCl pH 8.0, 10.56 mM NaCl, 0.44 mM KC1, 2.2 mM MgC12, 2.2 mM 
CaC12 0.5 M guanidine-HCl, 8 mM cysteamine and 4 mM cystamine, 0.055% 

20 PEG-4000 and 0.55 M L-arginine. 

p4Gal-Tl Enzyme Assays: The in vitro assay for p4Gal-Tl enzyme 
activity was performed as described (Boeggeman et al., Glvcobiology, 12:395- 
407 (2002)). The 3 H-labeled-UDP-Gal was used as sugar donor and GlcNAc as 
the sugar acceptor. A reaction without GlcNAc was used as a control. The 

25 fusion SRCDp4Gal-Tl proteins showed the same enzymatic activities when 
compared to the wild-type recombinant P4Gal-Tl (amount of product 
formation/min/mol of protein). 

Binding of recombinant proteins to UDP-affmity column: UDP- 
agarose was used to bind the active proteins at 4°C. Columns containing 1 ml 

30 bed volume of UDP-agarose were pre-equilibrated with 10 ml of 25 mM 
cacodylic buffer, pH 7.6, containing 25 mM MnC12. The renatured protein 
solutions (0.5 mg) were adjusted to the pre-equilibration buffer conditions and 
applied to the columns. After loading the sample, the pass-through was re- 
cycled. It was then analyzed for the unbound protein by SDS-P AGE. The 



WO 2004/063344 



PCT/US2004/000470 



columns were washed 3X with 1-ml portions of equilibration buffer. The bound 
protein was eluted with the elution buffer consisting of 25 mM cacodylic buffer, 
pH 7.6, 25 mM EDTA and 1M NaCl. Fractions of 0.5 ml were collected from 
the columns, and aliquots were analysed by SDS-PAGE. For enzymatic analysis 
5 the appropriate fractions were pooled and dialysed against 50 mM Tris-HCl pH 
8.0at4°C. 

Circular dichroism-spectroscopy: Far ultraviolet circular dichroism 
spectra of recombinant proteins was measured in a JASCO J-715 
spectropolarimeter. The spectra for each sample was scanned at a speed of 100 

1 0 nm/min. Far ultraviolet circular dichroism spectra were obtained from 1 90 to 
250 nm at room temperature using a pathlength of 0.1 mm with a 1 .0 nm 
bandwidth. Protein was dissolved in Tris-HCl pH 8.0 at concentrations of 0.5 
mg/ml. Buffer blanks obtained in the same cuvette were subtracted to generate 
net spectra. Raw ellipticity data were transformed into molar residue ellipticity 

1 5 using the concentration values determined from optical ultraviolet absorbance 
measurements. No smoothing or data averaging was applied. 

Results 

Influence of the stem reg ion (SR) cova lently linke d to the iV-ter mini of thp 
20 catalytic domain (CD) of t he B4Gal-T family members on the in vitro folding 
efficiency of the reco mbinant proteins from inclusion bodies 
The catalytic domain of p4Gal-Tl (amino acid residues 130-402) 
(sequence in Figure 3A, SEQ ID NO: 9) accumulated as an insoluble material 
within the inclusion body fraction upon expression in E. coli. Essentially all the 
25 proteins with the stem regions covalently linked to the A^-termini of the catalytic 
domain (sequence in Figure 3A, SEQ ID NO: 9) also accumulated in the 
insoluble inclusion body fraction. Contaminating proteins from the inclusion 
bodies were removed by washing the insoluble pellet several times with a buffer 
containing 25% sucrose as described herein. The yield of the purified inclusion 
30 bodies, varied with the protein that is being expressed (Table VH). The proteins 
produced a single protein band of expected molecular size on SDS-PAGE gel 
(Figure 4). These insoluble inclusion bodies required 5 M guanidine HC1 for 
solubilization. 
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Sulphonation of the inclusion bodies of CDp4Gal-Tl prior to the in vitro 
folding step gave higher yields of final folded proteins. The yield of sulphonated 
protein from the purified inclusion bodies was generally half of the starting 
protein material (Table VH). Oxido-shuffling reagents were present during the 
5 renaturation step to regenerate the enzyme activity from all the recombinant 
sulphonated proteins. The recombinant proteins having the stem region fused at 
the JV-termini of the catalytic domain (SRCD) remained soluble during the 
renaturation step in 0.5 M guanidine HC1 and the oxido-shuffling agents. Upon 
subsequent dialysis of the SRCD-protein solutions, to remove the guanidine 

10 HC1, both the folded and misfolded molecules remained soluble as judged by 
SDS-PAGE analysis. In contrast, a majority of the misfolded protein molecules 
aggregated and precipitated in the protein solution containing only the CD. The 
recovery of soluble human and bovine SRCDp4Gal-Tl is 3- to 9-fold higher 
than proteins having only the CD domain following refolding in the absence of 

1 5 PEG-4000 and L-arginine. (Table VIE). 
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Table Vn 

Yield of inclusion bodies and sulphonated proteins 

Recombinant Inclusion Amount of sulphonated 

5 Proteins body(mg)* protein recovered from 

inclusion bodies (me)" 1 " 

Bovine CDp4GalT-l 120 65 

Bovine SRCDp4GalT-l 1 13 50 

10 Human CDp4GalT-l 80 37 

Human SRCDp4GalT-l 90 50 

*Amount of inclusion bodies obtained from the pellet of one liter of IPTG 
induced bacterial culture. 
15 + Amount of sulphonated protein was determined by SDS-P AGE and by 
protein estimation using the Bradford method (Bio Rad). 



20 Table VIII 

In vitro renaturation of the sulphonated inclusion bodies 

- PEG/L- Arg + +PEG/L-Arg + 

CDp4Gal-Tl SRCDp4Gal-Tl CDp4Gal-Tl SRCDp4Gal-Tl 

25 mg (%)" mg (%)" mg (%)" mg (%)" 

Bovine 2.7 (14) 9.4 (47) 8.3 (42) 18.0 (90) 



Human 2.1 (11) 19.0 (95) 14.0 (69) 20.0 (100) 

30 + 20 mg of sulphonated proteins were folded in the buffer previously described 
that does not contain PEG-4000 and L-arginine (condition I) (Boeggeman, et 
al.). The proteins after the renaturation step were dialyzed, centrifuged and their 
concentration determined by SDS-P AGE and the Bradford method (BioRad) 

35 'Percentage recovery of the soluble protein after in vitro folding of 20 mg of 
sulphonated 
p4Gal-Tl 

A The soluble renatured CDp4Gal-Tl protein contains almost all folded 
40 molecules, whereas the soluble renatured SRCDp4Gal-Tl contains misfolded 
proteins that are removed on UDP-agarose columns (see Table DC) 

Renaturation conditions that increase the folding efficiency and 
specific activity of P4GaI-Tl: After folding the bovine wild-type CDp4Gal-Tl 
45 in a renaturation buffer I (described herein), the misfolded proteins precipitated 
out during subsequent dialysis. In an attempt to increase the amount of the 
folded native protein during the renaturation step, various folding conditions 

•VI 



WO 2004/063344 



PCTYUS2004/000470 



consisting of seven different solutions were tested. The results were compared 
to the yields obtained according to condition I (Figure 5). The tested inclusion 
bodies were dissolved in 5 M guanidine HC1 to a final protein concentration of 1 
mg/ml. Aliquots of 50 |il of the protein solution were added to 950 |xl of the 
5 folding buffers in an eppendorf tube, mixed gently and incubated at 4°C for 

times ranging from 4 hours to overnight. The folding buffers varied, (conditions 
I to VIE as disclosed herein) with respect to: pH from 6.5 (in conditions II, HI, 
IV and VII), to 8.0 (in conditions V, VI, and VIII); cations (in conditions II, IV, 
VI and Vm) and chelators (conditions I, HI, V and VII). Conditions VI, VII and 

10 VIE had polar additives (L-arginine). Conditions IV and V had non-polar 

additives, such as sucrose. Condition III had both sucrose and L-arginine. Some 
buffers also had PEG-4000 (conditions III, IV, VI and VIII). The appropriate 
redox environment was created by the presence of 100 raM reduced (GSH) and 
10 mM oxidized (GSSH) glutathione, except under conditions I and Vm which 

15 contained cysteamine (8 mM) and cystamine (4 mM). 

After overnight incubation and dialysis against ImM Tris-HCl pH 8.0, 
samples were centrifuged and 5 \il of supernatant from each sample was tested 
for galactosyltransferase activity and compared with each other (Figure 5). The 
results showed the highest activity under the buffer condition VI and Vm. Both 

20 of these had 0.55 M L-arginine and 0.055% PEG-4000 (Figure 5). All folding 
conditions were tested in the presence of 30 mM lauryl maltoside, which appears 
to have a negative effect on the folding yields of the recombinant proteins, even 
in the presence of 0.55 M L-arginine and 0.055% PEG-4000. The inclusion of 
sucrose had an inhibitory effect (condition IV and V). No preference was 

25 observed for pH, salts, or cations within the conditions tested. The redox 

environment created by 8 mM cysteamine: 4 mM cystamine, (condition VIII), 
was preferred over 100 mM GSH:10 mM GSSH (condition VI). Enzymatic 
assays (Figure 5) revealed that the combination of L-arginine and PEG-4000 in 
condition Vm, increased the in vitro folding efficiency of the CD and SRCD 

30 proteins by about 7- and 3-fold, respectively, compared to condition I which did 
not contain PEG-4000 and L-arginine. 

Misfolded proteins in the renatured SRCDp4Gal-Tl: Renatured 
proteins were analyzed on SDS-PAGE gels under native and reducing conditions 
to determine whether the renatured SRCDp4Gal-Tl proteins were natively 
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folded or misfolded. Analyses under native conditions, in the absence of 0- 
mercaptoethanol (PME) and without boiling the samples, revealed that both 
bovine and human soluble SRCDp4Gal-Tl proteins contained misfolded 
molecules that aggregated and remained at the top of the gel (Figure 6 (-)). 
5 Under reducing conditions, using pME in the boiled samples, the misfolded 
molecules of both bovine and human soluble SRCDp4Gal-Tl proteins did not 
appear on the top of the wells in SDS-PAGE and produced single protein bands 
of the expected sizes (35 kDa for bovine species and 34 kDa for human species) 
(Figure 6 (+)). 

10 Binding of recombinant proteins to affinity columns: Folded and 

active CDp4Gal-Tl obtained from inclusion bodies bound to, and was eluted 
from, a UDP-column with 25 mM EDTA and 1M NaCL Thus, binding of 
soluble renatured CD and SRCDp4Gal-Tl proteins to UDP-agarose columns 
was tested. Native SDS gel analyses of the soluble renatured CDp4Gal-Tl 

15 protein, without pME and boiling, showed that all of the soluble renatured CD 
protein was natively folded. The renatured SRCDp4Gal-Tl proteins were then 
applied to a UDP-agarose column. Nearly all of the renatured SRCDp4Gal-Tl 
proteins bound to, and eluted from, the UDP-agarose column (Figure 7). The 
specific enzymatic activity of the SRCDp4Gal-Tl protein (product 

20 formation/min/ng protein) before and after being passed through the UDP- 
agarose column was measured to determine if the soluble protein fraction of the 
renatured SRCDp4Gal-Tl protein contained properly folded molecules (Figure 
8). Also, the samples were analyzed in the absence of PME and without boiling. 
When condition I was used to fold bovine- and human-SRCD proteins (Figure 

25 8 A), the folded molecules that eluted with 25 mM EDTA and 1 M NaCl, 

(Figure 8 A (+)) had 2- and 6-fold higher specific activities (Figure 8A, crossed- 
hatched bars), respectively, when compared to the protein before loading onto 
the affinity column (Figure 8A, black bars). The renatured human SRCDP4Gal- 
Tl sample showed more misfolded proteins compared to bovine SRCDp4Gal- 

30 Tl (Figure 8A, bovine versus human black bars). In contrast, folding according 
to condition VIE, that contains PEG-4000 and L-arginine (Figure 8B), increased 
properly folded human SRCD compared to condition I (Figure 8B vs Figure 8A, 
black bars, human SRCD). Protein eluted from the UDP-column under 
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condition VIE (Figure 8B, cross bar) had ~ 2-fold higher specific activity as 
compared to the proteins before being loaded on the affinity column (Figure 8B, 
black bars). Although soluble, a portion of the protein in the SRCD samples is 
not folded to enable binding to UDP-agarose. The increased specific activity of 

5 the eluted proteins indicated the amount of misfolded molecules present in the 
sample before being bound to the UDP-agarose column. These results agreed 
with the observation that the misfolded proteins in the samples before being 
passed through the UDP-agarose columns also remained at the top of the wells in 
the SDS-PAGE gel (Figures 7B and C, lane U). The soluble bovine and human 

1 0 CDp4Gal-Tl behaved the same way regardless of the buffer used for folding 
(condition I or VIII) (Table DC). The specific activity of bovine and human 
CDp4Gal-Tl before and after binding on UDP-agarose columns did not change. 
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Circular dichroism-spectra of recombinant p4Gal-Tl : Circular 
dichroism experiments were done with purified p4Gal-Tl proteins to determine 
if the addition of the stem disturbed the overall secondary structure of p4Gal-Tl. 
5 No significant differences in secondary structure relative to the wild-type 
CDp4Gal-Tl were detected for the bovine or human proteins. All systems 
showed extrema of negative ellipticity between 250 and 200 nm. The circular 
dichroism-spectrum of the wild-type enzyme was highly reproducible and 
showed negative extrema at 208 and 220 nm indicative of a-helicity. There was 
10 a small change in ellipticity observed in the 200-210 nm region upon the 
addition of the stem in both bovine and human SRCD proteins. 

All publications, patents and patent applications are incorporated herein 
by reference. While in the foregoing specification this invention has been 
15 described in relation to certain preferred embodiments thereof, and many details 
have been set forth for purposes of illustration, it will be apparent to those skilled 
in the art that the invention is susceptible to additional embodiments and that 
certain of the details described herein may be varied considerably without 
departing from the basic principles of the invention. 

20 
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What is claimed is: 

1 . A purified and isolated catalytic domain from a 0(1 ,4)-galactosyltransferase I 
that catalyzes formation of a glucose-/?(l,4)-N-acetylglucosamine bond at a 

5 greater rate than wild-type 0(1 ,4)-galactosyltransferase I. 

2. The catalytic domain of claim 1, wherein the rate of formation of the 
glucose-0(l,4)-N-acetylglucosamine bond is at least two-fold, five-fold, or ten- 
fold greater than wild-type /3(l,4)-galactosyltransferase L 

10 

3 . The catalytic domain of claim 1 , wherein 

(a) the catalytic domain has a conservative amino acid exchange at amino 
acid position 228, 

(b) the catalytic domain has a conservative amino acid exchange at amino 
15 acid position 229, 

(c) the catalytic domain has a conservative amino acid exchanges at 
amino acid positions 228 and 229, 

(d) lysine is exchanged for arginine at amino acid position 228, or 

(e) lysine is exchanged for arginine at amino acid position 228 and 
20 glycine is exchanged for alanine at amino acid position 229 

4. A polypeptide comprising the catalytic domain of claim 1 . 

5 . A purified and isolated catalytic domain from a 0(1 ,4)-galactosyltransferase I 
25 that catalyzes formation of an N-acetylgalactosamine-j3(l ,4)-N- 

acetylglucosamine bond. 

6. The catalytic domain of claim 5, wherein 

(a) the catalytic domain has a conservative amino acid exchange at amino 
30 acid position 289; 

(b) the catalytic domain has a tyrosine at amino acid position 289 
exchanged with a leucine, isoleucine, or asparagine; 

(c) the catalytic domain has a cysteine at amino acid position 342 is 
exchanged by a threonine, 

SS 
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(d) any combination of (a) and (c), or 

(e) any combination of (b) and (c). 

7. A polypeptide comprising the catalytic domain of claim 5 . 

5 

8. A purified and isolated catalytic domain from a j3(l,4)-galactosyltransferase I 
that catalyzes formation of an N-acetylgalactosamine-^l^^-glucose bond in the 
presence of a-lactalbumin. 

10 9. The catalytic domain of claim 8, wherein 

(a) the catalytic domain has a conservative amino acid exchange at amino 
acid position 289, 

(b) tyrosine is exchanged with leucine, isoleucine, or asparagine at amino 
acid position 289, 

15 (c) a cysteine at amino acid position 342 is exchanged with a threonine, 

(d) any combination of (a) and (c), or 

(e) any combination of (b) and (c). 

10. A polypeptide comprising the catalytic domain of claim 8. 

20 

11. A purified and isolated catalytic domain from a /3(1 ,4)- 
galactosyltransferase I that catalyzes formation of an N-acetylglucosamine- 
j3(l,4)-N-acetylglucosamine bond. 

25 12. The catalytic domain of claim 1 1 , wherein 

(a) the catalytic domain has a conservative amino acid exchange at amino 
acid position 228, 

(b) the catalytic domain has a conservative amino acid exchange at amino 
acid position 289, 

30 (c) the catalytic domain has conservative amino acid exchanges at amino 

acid positions 228 and 289, or 

(d) lysine is exchanged for arginine at amino acid position 228, and 
leucine is exchanged for tyrosine at amino acid position 289. 

5g> 
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13. A polypeptide comprising the catalytic domain of claim 1 1 . 

14. A purified and isolated catalytic domain from a jS(l,4)-galactosyltransferase 
I that catalyzes formation of a mannose-/3(l,4)-N-acetylglucosamine bond. 

5 

15. The catalytic domain of claim 14, wherein 

(a) the catalytic domain has a conservative amino acid substitution at 
amino acid position 228, 

(b) the catalytic domain has a conservative amino acid substitution at 
1 0 amino acid position 289, 

(c) the catalytic domain has conservative amino acid substitutions at 
amino acid positions 228 and 289, or 

(d) lysine is exchanged for arginine at amino acid position 228, and 
leucine is exchanged for tyrosine at amino acid position 289. 

15 

16. A polypeptide comprising the catalytic domain of claim 14. 

17. A purified and isolated catalytic domain from a /?(l,4)-galactosyltransferase 
I that catalyzes formation of a galactose-/3(l,4)-N-acetylglucosamine-6-S03 

20 bond. 

18. The catalytic domain of claim 17, wherein 

(a) the catalytic domain has conservative amino acid exchanges at amino 
acid positions 279 and 280, or 
25 (b) serine is exchanged for lysine at amino acid position 279, and 

threonine is exchanged for phenylalanine at amino acid position 280. 

19. A polypeptide comprising the catalytic domain of claim 1 7. 



30 20. A purified and isolated polypeptide that promotes in vitro folding of a 
catalytic domain of /3(l,4)-galactosyltransferase L 



51 
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21 . The polypeptide of claim 20, wherein the amino acid segment comprises an 
amino acid sequence described by SEQ ID NO: 6 or 7; or has an amino acid 
sequence described by SEQ ID NO: 6 or 7. 

5 22. A nucleic acid segment encoding a polypeptide according to any one of 
claims 1,5, 8,11, 14, or 17. 

23. An expression cassette comprising the nucleic acid segment of claim 22. 

10 24. A cell comprising the nucleic acid segment of claim 22, or the expression 
cassette of claim 23. 



25. A method to synthesize a glucose-|8(l,4)-N-acetylglucosamine moiety 
comprising: 

15 incubating a reaction mixture comprising the jS(l,4)-galactosyltransferase 

I according to claim 1, with a donor comprising glucose, and an acceptor 
comprising N-acetylglucosamine. 

26. The method of claim 25, wherein the donor is UDP-glucose, the acceptor is 
20 N-acetylglucosamine, or the donor is UDP-glucose and the acceptor is N- 

acetylglucosamine. 

27. An oligosaccharide comprising a glucose-j3(l ,4)-N-acetylglucosamine 
moiety synthesized according to the method of claim 25. 

25 

28. A method to synthesize an N-acetylgalactosamine-j3(l ,4)-N- 
acetylglucosamine moiety comprising: 

incubating a reaction mixture comprising the /3(l,4)-galactosyltransferase 
I according to claim 5, with a donor comprising N-acetylgalactosamine, and an 
30 acceptor comprising N-acetylglucosamine. 

29. The method of claim 28, wherein the donor is UDP-N-acetylgalactosamine, 
the acceptor is N-acetylglucosamine, or the donor is UDP-N- 
acetylgalactosamine and the acceptor is N-acetylglucosamine. 
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30. An oligosaccharide comprising an N-acetylgalactosamine-j3(l ,4)-N- 
acetylglucosamine moiety synthesized according to the method of claim 28. 

5 3 1 . A method to synthesize an N-acetylgalactosamin^j8(l,4)-glucose moiety 
comprising: 

incubating a reaction mixture comprising the /?(l,4)-galactosyltransferase I 
according to claim 8, c^lactalbumin,' a donor comprising N-acetylgalactosamine, 
and an acceptor comprising glucose. 

10 

32. The method of claim 31, wherein the donor is UDP-N-acetylgalactosamine, 
the acceptor is glucose, or the donor is UDP-N-acetylgalactosamine and the 
acceptor is glucose. 

15 33. An oligosaccharide comprising an N-acetylgalactosamine-|S(l,4)-glucose 
moiety synthesized according to the method of claim 31. 

34. A method to synthesize an N-acetylglucosamine-j3(l,4)-N- 
acetylglucosamine moiety comprising: 

20 incubating a reaction mixture comprising a j8(l,4)-galactosyltransferase I 

according to claim 11, with a donor comprising N-acetylglucosamine, and an 
acceptor comprising N-acetylglucosamine. 

35 . The method of claim 34, wherein the donor is UDP-N-acetylglucosamine, 
25 the acceptor is N-acetylglucosamine, or the donor is UDP-N-acetylglucosamine 

and the acceptor is N-acetylglucosamine. 

36. An oligosaccharide comprising an N-acetylglucosamine-j8(l,4)-N« 
acetylglucosamine moiety synthesized according to the method of claim 34. 

30 

37. A method to synthesize a mannose-jS(l,4)-N-acetylglucosamine moiety 
comprising: 
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incubating a reaction mixture comprising the |3(l,4)-galactosyltransferase I 
according to claim 14, with a donor comprising mannose, and an acceptor 
comprising N-acetylglucosamine. 

5 38. The method of claim 37, wherein the donor is UDP-mannose, the acceptor 
is N-acetylglucosamine, or the donor is UDP-mannose and the acceptor is N- 
acetylglucosamine. 

39. An oligosaccharide comprising a mannose-]3(l,4)-N-acetylglucosamine 
1 0 moiety synthesized according to the method of claim 37. 

40. A method to synthesize a galactose-j3(l ,4)-N-acetylglucosamine-6-S0 3 
moiety comprising: 

incubating a reaction mixture comprising the j8(l,4)-galactosyltransferase 
15 I according to claim 17, with a donor comprising galactose, and an acceptor 
comprising N-acetylglucosamine-6-S03. 

41. The method of claim 40, wherein the donor is UDP-galactose, the acceptor 
is N-acetylglucosamine-6-S03, or the donor is UDP-galactose and the acceptor is 

20 N-acetylglucosamine-6-S03- 

42. An oligosaccharide comprising a galactose-]3(l ,4)-N-acetylglucosamine-6- 
SO3 moiety synthesized according to the method of claim 40. 

25 43. A method comprising: 

incubating a reaction mixture comprising an antigen having an acceptor, 
a donor, and the |3(l,4)-galactosyltransferase I according to any one of claims 1, 
5, 8, 11, 14, or 17 under conditions wherein the j3(l,4)-galactosyltransferase I 
catalyzes bond formation between the donor and the acceptor on the antigen and 

30 causes an increase in the immunogenicity of the antigen. 

44. The method of claim 43, wherein the donor is selected from the group 
consisting of UDP-galactose, UDP-mannose, UDP-N-acetylglucosamine, UDP- 
glucose, GDP-mannose, and UDP-N-acetylgalactosamine. 

C^6 
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45 . The method of claim 43, wherein the acceptor is a carbohydrate, a 
glycoprotein, or a glycolipid. 

5 46. The method of claim 45, wherein the carbohydrate is selected from the 
group consisting of a monosaccharide, a disaccharide, an oligosaccharide, and a 
polysaccharide. 

47. The method of claim 43, wherein the antigen is a vaccine. 

10 

48. The method of claim 43, wherein the antigen is a protein or a glycoprotein. 

49. An antigen prepared according to the method of claim 43. 

15 50. A method to prepare a saccharide composition having a defined sequence 
comprising: 

contacting an acceptor with a first donor in the presence of a first 
glycosyltransferase to catalyze linkage of the acceptor with the donor to form a 
first saccharide composition; and 
20 contacting the first saccharide composition with a second donor in the 

presence of a second glycosyltransferase to catalyze linkage of the first 
saccharide composition with the second donor to form a second saccharide 
composition, 

wherein at least one of the first glycosyltransferase or the second 
25 glycosyltransferase is a ]8(l,4)-galactosyltransferase I according to any one of 
claims 1, 5, 8, 1 1, 14, or 17, and the other first or second glycosyltransferase is 
selected from the group consisting of a galactosyltransferase, a sialyltransferase, 
a fucosyltransferase, an N-acetylgalactosaminyltransferase, an N- 
acetylglucosaminyltransferase, and a glucuronyltransferase. 

30 

5 1 . The method of claim 5 0, wherein the first donor or the second donor is 
selected from the group consisting of UDP-galactose, UDP-mannose, UDP-N- 
acetylglucosamine, UDP-glucose, GDP-mannose, UDP-N-acetylgalactosamine, 
UDP-glucuronic acid, GDP-Fucose, and CMP-N-acetylneuraminic acid. 
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52. The method of claim 50, wherein the acceptor is a carbohydrate, a 
glycoprotein, or a glycolipid. 



5 53. The method of claim 52, wherein the carbohydrate is selected from the 
group consisting of a monosaccharide, a disaccharide, an oligosaccharide, and a 
polysaccharide. 

54. The method of claim 50, wherein the acceptor is an antigen. 

10 

55. The method of claim 54, wherein the antigen is a vaccine. 



56. The method of claim 54, wherein the antigen is a protein or a glycoprotein. 



15 57. A composition prepared according to the method of claim 50. 

58. A kit comprising packaging material, and a polypeptide comprising the 
catalytic domain of any one of claims 1, 5, 8, 11, 14, or 17. 

20 59. The kit of claim 58, further comprising a donor. 

60. The kit of claim 59, wherein the donor is selected from the group consisting 
of UDP-galactose, UDP-mannose, UDP-N-acetylglucosamine, UDP-glucose, 
GDP-mannose, UDP-N-acetylgalactosamine, UDP-glucuronic acid, GDP- 
25 Fucose, and CMP-N-acetylneuraminic acid. 
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gllllli S. L T & C~ P K B S P L L V * P HL I K P V I P V D L 

KLVBO QKPKVKZ. GGP.YTPHD C X S P H K V 
GaAGCTTGTGGRG<aGCA(^CCCGA&GGT<^G^ 

A X X X P £ JtSr'&QSKXiKYWLYYLHPXL Q £ Q Q 
CCMCATaEXTCC3OTCCGCAACCGGCAGGA^^ 

LDYGIYVIKQAGBSHEJffcAKLLffV G H K B 
TTAGACTAX GGCAT CTAT GTTAT CAAC CAGGCT GG&GAGT C CAT GTT CAAC C GG GC&AAGCT C CT C&AT GTTGGCTTTRAAGA 

ALKDYD YWCEVB SBVDLXPH3JDHHTYE. 
GGC CTT G&AGGACTAT G&CTACRACT GCTTT GT GTTTAGC GAT GTGGACGT CAT C C CMJ G&AG GACCAXAACAC CTAGAGGT 

CBS QPHHXSVAKDKBGHSI.PYVQYEGGV 
GCTTTT CACAGC CAC GG CACATTT CT GTAGCAAT GGRTAAGTTT GGATTTAGC CTAC CTTAG GT GCAGTATTT T GGAGGT GT C 

SAL S X Q Q B L SXlTGFPir*YWGVGGBDDDX 
T CT GCT CTAAGTABACAACAGTTT CT GAG CAT C&AT GGATTT G CTAATAACTACT GGGGCT GGGGAGGT G&AOT GAT GACAT 

YJTRLAHJL GHSVSfcPHAVX GKCXtHXXHS 
TTAT&ACAGfiTTAGCTTTTAGAGGGAT GT CTGT GT CT C GCC CARAT GCTGT OT C GGGRAGT GT CGGAT GRXC C GG CACT G GA 

E.DKKIT. EPJFP QUPBItlAHTKETML SD GLH 
GkCACAAGAAASAT GRAC CTAAT C CT CAGAGGTTT GACCG&ATT GCACATACSAAGGAGACAAr GCT CT CT GAT GGTTT GRAC 

SLTYKVLSVQS.YPXtYTKXTVSX G T P S 
T PA CT CAECTA EAT EST GTTAGRGGT C CAGAG GTAC C C GTT GTAIAG Vblg&StCRC&GMGZC&ECGGGZLCGCCGRGC 



Fig. 3A 
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I ^? <r I "5? V I K Q A G D T I FVSt&K^X^VffC r n t« a 
MCTarGGCMCTOTCTTOTCAACCAGCCGGGiGACACra 

L X D Y ? T C rVTSDVDL IPHHDHMAVCr 

CTTCAA&&ACTAi&ACTACACcT«rrrr&TGT^ 

£ 3 • Q P R H I SViHDKrcrSLPTfVO-S rC GUS 
TTTCACAGCCACGGCACATTTCCGlTeCAATWATA&GT^ 

oiLJL.iL.?.,/ 1 Qtlt xvGrpxrxrYVGVGGEDDD ir 

GCT CT A AGT AAAC AAC AGTTT CT AAC C AT C AAT GGATTT C CT & AT AATT ATTGGGGCT GGGGAGG AGAA&ftT GAT &AC ATTTT 

NRLVTRGMS I 3RPHAVVGRCREI IB.HSS 
TMCAGATTAGTTTWA&AGGCATGTCTATATC^ 

DKKNEPWP Q RTOR I AHTKE T UX 3D G L H S 
ACAAAAAAAAT&AACCCAATCCTCACA&GTTT&ACCG-A&TTGCACACACAAAGGAGACAATGCTCTCTGAT&GTTTGAACTCA 

LTTfQV LDUQR-ypLVTQ XTVS I & T P 3 
CTCACCTACCA&GTGCTGGATGTACA&AGATACCCATTGTATACCCAAATCa.Ca^gCTCagC<?gGBCGCCgacg 
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SEQUENCE LISTING 

<110> National Institutes of Health 
5 Qasba, Pradman 

Ramakr i shnan , Boopa thy 

<120> Catalytic Domains of 3 (1/ 4 ) -Galactosyl transferase I Having Altered 
Donor and Acceptor Specificities, Domains That Promote In Vitro 
10 Protein Folding, and Methods for Their Use 

<130> 1662.001WO1 

<150> US 60/439,298 
15<151> 2003-01-10 

<150> US 60/450,250 
<151> 2003-02-25 

20<160> 23 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
25<211> 7 
<212> PRT 
<213> Homo sapiens 

<400> 1 

30Phe Asn Arg Ala Lys Leu Leu 
1 5 

<210> 2 
<211> 7 
35<212> PRT 

<213> Homo sapiens 



<400> 2 

Tyr Val Gin Tyr Phe Gly Gly 
40 1 5 
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<210> 3 
<211> 398 
<212> PRT 

<213> Homo sapiens 

5 

<400> 3 

Met Arg Leu Arg Glu Pro Leu Leu Ser Arg Ser Ala Ala Met Pro Gly 

15 10 15 

Ala Ser Leu Gin Arg Ala Cys Arg Leu Leu Val Ala Val Cys Ala Leu 
10 20 25 30 

His Leu Gly Val Thr Leu Val Tyr Tyr Leu Ala Gly Arg Asp Leu Ser 

35 40 45 

Arg Leu Pro Gin Leu Val Gly Val Ser Thr Pro Leu Gin Gly Gly Ser 
50 55 60 

15Asn Ser Ala Ala Ala He Gly Gin Ser Ser Gly Asp Leu Arg Thr Gly 
65 70 75 80 

Gly Ala Arg Pro Pro Pro Pro Leu Gly Ala Ser Ser Gin Pro Arg Pro 

85 90 95 

Gly Gly Asp Ser Ser Pro Val Val Asp Ser Gly Pro Gly Pro Ala Ser 
20 100 105 no 

Asn Leu Thr Ser Val Pro Val Pro His Thr Thr Ala Leu Ser Leu Pro 

115 120 125 

Ala Cys Pro Glu Glu Ser Pro Leu Leu Val Gly Pro Met Leu He Glu 
130 135 140 

25Phe Asn Met Pro Val Asp Leu Glu Leu Val Ala Lys Gin Asn Pro Asn 
145 150 155 160 

Val Lys Met Gly Gly Arg Tyr Ala Pro Arg Asp Cys Val Ser Pro His 

165 170 175 

Lys Val Ala He He He Pro Phe Arg Asn Arg Gin Glu His Leu Lys 
30 180 185 190 

Tyr Trp Leu Tyr Tyr Leu His Pro Val Leu Gin Arg Gin Gin Leu Asp 

195 200 205 

Tyr Gly He Tyr Val He Asn Gin Ala Gly Asp Thr He Phe Asn Arg 
210 215 220 

35Ala Lys Leu Leu Asn Val Gly Phe Gin Glu Ala Leu Lys Asp Tyr Asp 
225 230 235 240 

Tyr Thr Cys Phe Val Phe Ser Asp Val Asp Leu He Pro Met Asn Asp 

245 250 255 

His Asn Ala Tyr Arg Cys Phe Ser Gin Pro Arg His He Ser Val Ala 
40 260 265 270 

Met Asp Lys Phe Gly Phe Ser Leu Pro Tyr Val Gin Tyr Phe Gly Gly 
275 280 285 
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Val Ser Ala Ser Ser Lys Gin Gin Phe Leu Thr He Asn Gly Phe Pro 

290 295 300 

Asn Asn Tyr Trp Gly Trp Gly Gly Glu Asp Asp Asp He Phe Asn Arg 
305 310 315 320 

5Leu Val Phe Arg Gly Met Ser He Ser Arg Pro Asn Ala Val Val Gly 
325 330 335 

Thr Cys Arg Met He Arg His Ser Arg Asp Lys Lys Asn Glu Pro Asn 

340 345 350 

Pro Gin Arg Phe Asp Arg He Ala His Thr Lys Glu Thr Met Leu Ser 
10 355 360 365 

Asp Gly Leu- Asn Ser Leu Thr Tyr Gin Val Leu Asp Val Gin Arg Tyr 

370 375 380 

Pro Leu Tyr Thr Gin He Thr Val Asp He Gly Thr Pro Ser 
385 390 395 

15 

<210> 4 
<211> 399 
<212> PRT 

<213> Mus musculus 

20 

<400> 4 

Met Arg Phe Arg Glu Gin Phe Leu Gly Gly Ser Ala Ala Met Pro Gly 

1 5 io i 5 

Ala Thr Leu Gin Arg Ala Cys Arg Leu Leu Val Ala Val Cys Ala Leu 
25 20 25 30 

His Leu Gly Val Thr Leu Val Tyr Tyr Leu Ser Gly Arg Asp Leu Ser 

35 40 45 

Arg Leu Pro Gin Leu Val Gly Val Ser Ser Thr Leu Gin Gly Gly Thr 
50 55 60 

30Asn Gly Ala Ala Ala Ser Lys Gin Pro Pro Gly Glu Gin Arg Pro Arg 
65 70 75 80 

Gly Ala Arg Pro Pro Pro Pro Leu Gly Val Ser Pro Lys Pro Arg Pro 

85 90 95 

Gly Leu Asp Ser Ser Pro Gly Ala Ala Ser Gly Pro Gly Leu Lys Ser 
35 100 105 no 

Asn Leu Ser Ser Leu Pro Val Pro Thr Thr Thr Gly Leu Leu Ser Leu 

115 120 125 

Pro Ala Cys Pro Glu Glu Ser Pro Leu Leu Val Gly Pro Met Leu He 
130 135 140 

40 Asp Phe Asn He Ala Val Asp Leu Glu Leu Leu Ala Lys Lys Asn Pro 
145 150 155 i5o 
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Glu He Lys Thr Gly Gly Arg Tyr Ser Pro Lys Asp Cys Val Ser Pro 

165 170 175 

His Lys Val Ala He He He Pro Phe Arg Asn Arg Gin Glu His Leu 
180 185 190 

5Lys Tyr Trp Leu Tyr Tyr Leu His Pro He Leu Gin Arg Gin Gin Leu 
195 200 205 

Asp Tyr Gly He Tyr Val He Asn Gin Ala Gly Asp Thr Met Phe Asn 

210 215 220 

Arg Ala Lys Leu Leu Asn He Gly Phe Gin Glu Ala Leu Lys Asp Tyr 
1022 5 230 235 240 

Asp Tyr Asn Cys Phe Val Phe Ser Asp Val Asp Leu lie Pro Met Asp 

245 250 255 

Asp Arg Asn Ala Tyr Arg Cys Phe Ser Gin Pro Arg His He Ser Val 
260 265 270 

15Ala Met Asp Lys Phe Gly Phe Ser Leu Pro Tyr Val Gin Tyr Phe Gly 
275 280 285 

Gly Val Ser Ala Leu Ser Lys Gin Gin Phe Leu Ala lie Asn Gly Phe 

290 295 300 

Pro Asn Asn Tyr Trp Gly Trp Gly Gly Glu Asp Asp Asp He Phe Asn 
20305 310 315 320 

Arg Leu Val His Lys Gly Met Ser He Ser Arg Pro Asn Ala Val Val 

325 330 335 

Gly Arg Cys Arg Met He Arg His Ser Arg Asp Lys Lys Asn Glu Pro 
340 • 345 350 

2 5 Asn Pro Gin Arg Phe Asp Arg He Ala His Thr Lys Glu Thr Met Arg 
355 360 365 

Phe Asp Gly Leu Asn Ser Leu Thr Tyr Lys Val Leu Asp Val Gin Arg 

370 375 380 

Tyr Pro Leu Tyr Thr Gin He Thr Val Asp He Gly Thr Pro Arg 
30385 390 395 



<210> 5 
<211> 402 
<212> PRT 
35<213> Bos taurus 



<400> 5 

Met Lys Phe Arg Glu Pro Leu Leu 
1 5 
40Ala Ser Leu Gin Arg Ala Cys Arg 
20 



Gly Gly Ser Ala Ala Met Pro Gly 

10 15 
Leu Leu Val Ala Val Cys Ala Leu 
25 30 
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His Leu Gly Val Thr Leu Val Tyr Tyr Leu Ala Gly Arg Asp Leu Arg 

35 40 45 

Arg Leu Pro Gin Leu Val Gly Val His Pro Pro Leu Gin Gly Ser Ser 
50 55 60 

5His Gly Ala Ala Ala He Gly Gin Pro Ser Gly Glu Leu Arg Leu Arg 
65 70 75 80 

Gly Val Ala Pro Pro Pro Pro Leu Gin Asn Ser Ser Lys Pro Arg Ser 

85 90 95 

Arg Ala Pro Ser Asn Leu Asp Ala Tyr Ser His Pro Gly Pro Gly Pro 
10 100 105 no 

Gly Pro Gly Ser Asn Leu Thr Ser Ala Pro Val Pro Ser Thr Thr Thr 

115 120 125 

Arg Ser Leu Thr Ala Cys Pro Glu Glu Ser Pro Leu Leu Val Gly Pro 
130 135 140 

15Met Leu He Glu Phe Asn He Pro Val Asp Leu Lys Leu He Glu Gin 
145 150 155 iso 

Gin Asn Pro Lys Val Lys Leu Gly Gly Arg Tyr Thr Pro Met Asp Cys 

165 170 175 

He Ser Pro His Lys Val Ala He He He Leu Phe Arg Asn Arg Gin 
20 180 i 8 5 190 

Glu His Leu Lys Tyr Trp Leu Tyr Tyr Leu His Pro Met Val Gin Arg 

I 95 200 205 

Gin Gin Leu Asp Tyr Gly He Tyr Val He Asn Gin Ala Gly Glu Ser 
2X0 215 220 

25Met Phe Asn Arg Ala Lys Leu Leu Asn Val Gly Phe Lys Glu Ala Leu 
225 2 30 235 * 24 o 

Lys Asp Tyr Asp Tyr Asn Cys Phe Val Phe Ser Asp Val Asp Leu He 

245 250 255 

Pro Met Asn Asp His Asn Thr Tyr Arg Cys Phe Ser Gin Pro Arg His 
30 260 265 2 70 

He Ser Val Ala Met Asp Lys Phe Gly Phe Ser Leu Pro Tyr Val Gin 

275 280 285 

Tyr Phe Gly Gly Val Ser Ala Leu Ser Lys Gin Gin Phe Leu Ser He 
290 295 300 

35Asn Gly Phe Pro Asn Asn Tyr Trp Gly Trp Gly Gly Glu Asp Asp Asp 
305 310 315 320 

He Tyr Asn Arg Leu Ala Phe Arg Gly Met Ser Val Ser Arg Pro Asn 

325 330 335 

Ala Val He Gly Lys Cys Arg Met He Arg His Ser Arg Asp Lys Lys 
40 340- 345 350 

Asn Glu Pro Asn Pro Gin Arg Phe Asp Arg He Ala His Thr Lys Glu 
355 360 365 
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Thr Met Leu Ser Asp Gly Leu Asn Ser Leu Thr Tyr Met Val Leu Glu 

370 375 380 

Val Gin Arg Tyr Pro Leu Tyr Thr Lys He Thr Val Asp He Gly Thr 
3Q 5 390 395 400 

5Pro Ser 



<210> 6 

<211> 113 

10<212> PRT 

<213> Homo sapiens 



<400> 6 

Arg Asp Leu Ser Arg Leu Pro Gin Leu Val Gly Val Ser Thr Pro Leu 
15 1 5 io 15 

Gin Gly Gly Ser Asn Ser Ala Ala Ala He Gly Gin Ser Ser Gly Asp 

20 25 30 

Leu Arg Thr Gly Gly Ala Arg Pro Pro Pro Pro Leu Gly Ala Ser Ser 
35 40 45 

2 0Gln Pro Arg Pro Gly Gly Asp Ser Ser Pro Val Val Asp Ser Gly Pro 
50 55 60 

Gly Pro Ala Ser Asn Leu Thr Ser Val Pro Val Pro His Thr Thr Ala 
65 70 75 80 

Leu Ser Leu Pro Ala Cys Pro Glu Glu Ser Pro Leu Leu Val Gly Pro 
25 85 90 95 

Met Leu He Glu Phe Asn Met Pro Val Asp Leu Glu Leu Val Ala Lys 
100 105 no 

Gin 



30 

<210> 7 

<211> 85 

<212> PRT 

<213> Bos taurus 

35 

<400> 7 

i 

Arg Asp Leu Arg Arg Leu Pro Gin Leu Val Gly Val His Pro Pro Leu 

1 5 io is 

Gin Gly Ser Ser His Gly Ala Ala Ala He Gly Gin Pro Ser Gly Glu 
40 20 25 30 

Leu Arg Leu Arg Gly Val Ala Pro Pro Pro Pro Leu Gin Asn Ser Ser 
35 40 45 
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Lys Pro Arg Ser Arg Ala Pro Ser Asn Leu Asp Ala Tyr Ser His Pro 

50 55 60 

Gly Pro Gly Pro Gly Pro Gly Ser Asn Leu Thr Ser Ala Pro Val Pro 
65 70 75 80 

5Ser Thr Thr Thr Arg 
85 

<210> 8 
<211> 273 
10<212> PRT 

<213> Homo sapiens 

<400> 8 

Ser Leu Pro Ala Cys Pro Glu Glu Ser Pro Leu Leu Val Gly Pro Met 
15 1 5 io 15 

Leu He Glu Phe Asn Met Pro Val Asp Leu Glu Leu Val Ala Lys Gin 

20 25 30 

Asn Pro Asn Val Lys Met Gly Gly Arg Tyr Ala Pro Arg Asp Cys Val 
35 40 45 

20Ser Pro His Lys Val Ala He He He Pro Phe Arg Asn Arg Gin Glu 
50 55 60 

His Leu Lys Tyr Trp Leu Tyr Tyr Leu His Pro Val Leu Gin Arg Gin 
65 70 75 80 

Gin Leu Asp Tyr Gly He Tyr Val He Asn Gin Ala Gly Asp Thr He 
25 85 90 95 

Phe Asn Arg Ala Lys Leu Leu Asn Val Gly Phe Gin Glu Ala Leu Lys 

100 105 no 

Asp Tyr Asp Tyr Thr Cys Phe Val Phe Ser Asp Val Asp Leu He Pro 
H5 120 125 

30Met Asn Asp His Asn Ala Tyr Arg Cys Phe Ser Gin Pro Arg His He 
130 135 140 

Ser Val Ala Met Asp Lys Phe Gly Phe Ser Leu Pro Tyr Val Gin Tyr 
145 i 5 o i5 S 16Q 

Phe Gly Gly Val Ser Ala Ser Ser Lys Gin Gin Phe Leu Thr He Asn 
35 165 170 175 

Gly Phe Pro Asn Asn Tyr Trp Gly Trp Gly Gly Glu Asp Asp Asp He 

180 185 190 

Phe Asn Arg Leu Val Phe Arg Gly Met Ser He Ser Arg Pro Asn Ala 
195 200 205 

40Val Val Gly Thr Cys Arg Met He Arg His Ser Arg Asp Lys Lys Asn 
210 215 220 
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Glu Pro Asn Pro Gin Arg Phe Asp 
225 230 
Met Leu Ser Asp Gly Leu Asn Ser 
245 

5Gln Arg Tyr Pro Leu Tyr Thr Gin 
260 

Ser 



8 

Arg He Ala His Thr Lys Glu Thr 
235 240 
Leu Thr Tyr Gin Val Leu Asp Val 

250 255 
He Thr Val Asp He Gly Thr Pro 
265 270 



10<210> 9 
<211> 273 
<212> PRT 
<213> Bos taurus 

15<400> 9 

Ser Leu Thr Ala Cys Pro Glu Glu Ser Pro Leu Leu Val Gly Pro Met 

1 5 io 15 

Leu He Glu Phe Asn He Pro Val Asp Leu Lys Leu He Glu Gin Gin 
20 25 30 

2 0Asn Pro Lys Val Lys Leu Gly Gly Arg Tyr Thr Pro Met Asp Cys He 
35 40 45 

Ser Pro His Lys Val Ala He He He Leu Phe Arg Asn Arg Gin Glu 

50 55 60 

His Leu Lys Tyr Trp Leu Tyr Tyr Leu His Pro Met Val Gin Arg Gin 
2565 70 75 80 

Gin Leu Asp Tyr Gly He Tyr Val He Asn Gin Ala Gly Glu Ser Met 

85 90 95 

Phe Asn Arg Ala Lys Leu Leu Asn Val Gly Phe Lys Glu Ala Leu Lys 
100 105 no 

30 Asp Tyr Asp Tyr Asn Cys Phe Val Phe Ser Asp Val Asp Leu He Pro 
115 120 125 

Met Asn Asp His Asn Thr Tyr Arg Cys Phe Ser Gin Pro Arg His He 

130 135 140 

Ser Val Ala Met Asp Lys Phe Gly Phe Ser Leu Pro Tyr Val Gin Tyr 
35145 150 155 i 60 

Phe Gly Gly Val Ser Ala Leu Ser Lys Gin Gin Phe Leu Ser He Asn 

165 170 i 7 5 

Gly Phe Pro Asn Asn Tyr Trp Gly Trp Gly Gly Glu Asp Asp Asp He 
180 185 190 

40Tyr Asn Arg Leu Ala Phe Arg Gly Met Ser Val Ser Arg Pro Asn Ala 
195 200 205 



WO 2004/063344 



PCT/US2004/000470 



Val lie Gly Lys Cys Arg Met lie Arg His Ser Arg Asp Lys Lys Asn 

210 215 220 

Glu Pro Asn Pro Gin Arg Phe Asp Arg He Ala His Thr Lys Glu Thr 
225 230 235 2 40 

5Met Leu Ser Asp Gly Leu Asn Ser Leu Thr Tyr Met Val Leu Glu Val 
245 250 255 

Gin Arg Tyr Pro Leu Tyr Thr Lys He Thr Val Asp He Gly Thr Pro 
260 265 270 

Ser 



10 



<210> 10 
<2H> 1197 
<212> PRT 
15<213> Homo sapiens 



<400> 10 

Ala Thr Gly Ala Gly Gly Cys Thr Thr Cys Gly Gly Gly Ala Gly Cys 
1 5 10 15 

20Cys Gly Cys Thr Cys Cys Thr Gly Ala Gly Cys Cys Gly Gly Ala Gly 
20 25 30 • 

Cys Gly Cys Cys Gly Cys Gly Ala Thr Gly Cys Cys Ala Gly Gly Cys 

35 40 45 

Gly Cys Gly Thr Cys Cys Cys Thr Ala Cys Ala Gly Cys Gly Gly Gly 
25 50 55 60 

Cys Cys Thr Gly Cys Cys Gly Cys Cys Thr Gly Cys Thr Cys Gly Thr 



65 

80 



70 75 



Gly Gly Cys Cys Gly Thr Cys Thr Gly Cys Gly Cys Thr Cys Thr Gly 
85 90 95 

30Cys Ala Cys Cys Thr Thr Gly Gly Cys Gly Thr Cys Ala Cys Cys Cys 



100 105 110 

Thr Cys Gly Thr Thr Thr Ala Cys Thr Ala Cys Cys Thr Gly Gly Cys 

115 120 125 

Thr Gly Gly Cys Cys Gly Cys Gly Ala Cys Cys Thr Gly Ala Gly Cys 
35 130 135 140 

Cys Gly Cys Cys Thr Gly Cys Cys Cys Cys Ala Ala Cys Thr Gly Gly 
145 150 155 160 

Thr Cys Gly Gly Ala Gly Thr Cys Thr Cys Cys Ala Cys Ala Cys Cys 
165 170 175 

40Gly cys Thr Gly Cys Ala Gly Gly Gly Cys Gly Gly Gly Thr Cys Gly 



180 185 190 
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Ala Ala Cys Ala Gly 
195 

Thr Cys Gly Gly Gly 
210 

5Gly Gly Ala Cys Cys 
225 

Gly Gly Gly Gly Cys 
245 

Cys Thr Cys Cys Thr 
10 260 
Cys Thr Cys Cys Cys 
275 

Gly Gly Thr Gly Gly 
290 

15 Cys Ala Gly Thr Cys 
305 

Cys Cys Cys Thr Gly 
325 

Ala Ala Cys Thr Thr 
20 340 
Cys Ala Gly Thr Gly 
355 

Cys Gly Cys Ala Cys 
370 

25Gly Cys Cys Thr Gly 
385 

Cys Cys Cys Cys Gly 
405 

Cys Cys Cys Cys Ala 
30 420 
Thr Thr Thr Ala Ala 
435 

Ala Cys Cys Thr Gly 
450 

35Ala Ala Ala Gly Cys 
465 

Gly Thr Gly Ala Ala 
485 

Gly Cys Thr Ala Thr 
40 500 
Cys Thr Gly Cys Gly 
515 
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Thr Gly Cys Cys Gly Cys 
200 

Cys Ala Gly Thr Cys Cys 
215 

Thr Cys Cys Gly Gly Ala 
230 235 
Cys Cys Gly Gly Cys Cys 
250 

Cys Thr Ala Gly Gly Cys 
265 

Ala Gly Cys Cys Gly Cys 
280 

Cys Gly Ala Cys Thr Cys 
295 

Gly Thr Gly Gly Ala Thr 
310 315 
Gly Cys Cys Cys Cys Gly 
330 

Gly Ala Cys Cys Thr Cys 
345 

Cys Cys Cys Cys Ala Cys 
360 

Thr Gly Thr Cys Gly Cys 
375 

Cys Cys Cys Thr Gly Ala 
390 395 
Cys Thr Gly Cys Thr Thr 
410 

Thr Gly Cys Thr Gly Ala 
425 

Cys Ala Thr Gly Cys Cys 
440 

Gly Ala Gly Cys Thr Cys 
455 

Ala Gly Ala Ala Cys Cys 
470 475 
Gly Ala Thr Gly Gly Gly 
490 

Gly Cys Cys Cys Cys Cys 
505 

Thr Cys Thr Cys Thr Cys 
520 



Cys Gly Cys Cys Ala 
205 

Thr Cys Cys Gly Gly 
220 

Cys Cys Gly Gly Ala 
240 

Gly Cys Cys Gly Cys 
255 

Gly Cys Cys Thr Cys 
270 

Gly Cys Cys Cys Gly 
285 

Cys Ala Gly Cys Cys 
300 

Thr Cys Thr Gly Gly 
320 

Cys Thr Ala Gly Cys 
335 

Gly Gly Thr Cys Cys 
350 

Ala Cys Cys Ala Cys 
365 

Thr Gly Cys Cys Cys 
380 

Gly Gly Ala Gly Thr 
400 

Gly Thr Gly Gly Gly 
415 

Thr Thr Gly Ala Gly 
430 

Thr Gly Thr Gly Gly 
445 

Gly Thr Gly Gly Cys 
460 

Cys Ala Ala Ala Thr 
480 

Cys Gly Gly Cys Cys 
495 

Ala Gly Gly Gly Ala 
510 

Cys Thr Cys Ala Cys 
525 
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Ala Ala Gly Gly Thr Gly Gly Cys Cys Ala Thr Cys Ala Thr Cys Ala 

530 535 540 

Thr Thr Cys Cys Ala Thr Thr Cys Cys Gly Cys Ala Ala Cys Cys Gly 
545 550 555 560 

SGly Cys Ala Gly Gly Ala Gly Cys Ala Cys Cys Thr Cys Ala Ala Gly 
565 570 575 

Thr Ala Cys Thr Gly Gly Cys Thr Ala Thr Ala Thr Thr Ala. Thr Thr 

580 585 590 

Thr Gly Cys Ala Cys Cys Cys Ala Gly Thr Cys Cys Thr Gly Cys Ala 
10 595 600 605 

Gly Cys Gly Cys Cys Ala Gly Cys Ala Gly Cys Thr Gly Gly Ala Cys 

610 615 620 

Thr Ala Thr Gly Gly Cys Ala Thr Cys Thr Ala Thr Gly Thr Thr Ala 
625 630 635 640 

15Thr Cys Ala Ala Cys Cys Ala Gly Gly Cys Gly Gly Gly Ala Gly Ala 

645 650 655 

Cys Ala Cys Thr Ala Thr Ala Thr Thr Cys Ala Ala Thr Cys Gly Thr 

660 665 670 

Gly Cys Thr Ala Ala Gly Cys Thr Cys Cys Thr Cys Ala Ala Thr Gly 
20 675 680 685 

Thr Thr Gly Gly Cys Thr Thr Thr Cys Ala Ala Gly Ala Ala Gly Cys 

690 695 700 

Cys Thr Thr Gly Ala Ala Gly Gly Ala Cys Thr Ala Thr Gly Ala Cys 
705 710 715 720 

25Thr Ala Cys Ala Cys Cys Thr Gly Cys Thr Thr Thr Gly Thr Gly Thr 

725 730 735 

Thr Thr Ala Gly Thr Gly Ala Cys Gly Thr Gly Gly Ala Cys Cys Thr 

740 745 750 

Cys Ala Thr Thr Cys Cys Ala Ala Thr Gly Ala Ala Thr Gly Ala Thr 
30 755 760 765 

Cys Ala Thr Ala Ala Thr Gly Cys Gly Thr Ala Cys Ala Gly Gly Thr 

770 775 780 

Gly Thr Thr Thr Thr Thr Cys Ala Cys Ala Gly Cys Cys Ala Cys Gly 
785 790 795 800 

3 SGly Cys Ala Cys Ala Thr Thr Thr Cys Cys Gly Thr Thr Gly Cys Ala 

805 810 815 

Ala Thr Gly Gly Ala Thr Ala Ala Gly Thr Thr Thr Gly Gly Ala Thr 

820 825 830 

Thr Cys Ala Gly Cys Cys Thr Ala Cys Cys Thr Thr Ala Thr Gly Thr 
40 835 840 845 

Thr Cys Ala Gly Thr Ala Thr Thr Thr Thr Gly Gly Ala Gly Gly Thr 
850 855 Q6Q 
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Gly Thr Cys Thr Cys Thr Gly Cys Thr Thr Cys Ala Ala Gly Thr Ala 
865 870 875 880 

Ala Ala Cys Ala Ala Cys Ala Gly Thr Thr Thr Cys Thr Ala Ala Cys 
885 890 895 

5Cys Ala Thr Cys Ala Ala Thr Gly Gly Ala Thr Thr Thr Cys Cys Thr 
900 905 910 

Ala Ala Thr Ala. Ala Thr Thr Ala Thr Thr Gly Gly Gly Gly Cys Thr 

915 920 925 

Gly Gly Gly Gly Ala Gly Gly Ala Gly Ala Ala Gly Ala Thr Gly Ala 
10 930 935 , 940 

Thr Gly Ala Cys Ala Thr Thr Thr Thr Thr Ala Ala Cys Ala Gly Ala 
945 950 955 960 

Thr Thr Ala Gly Thr Thr Thr Thr Thr Ala Gly Ala Gly Gly Cys Ala 
965 970 975 

15Thr Gly Thr Cys Thr Ala Thr Ala Thr Cys Thr Cys Gly Cys Cys Cys 
980 985 990 

Ala Ala Ala Thr Gly Cys Thr Gly Thr Gly Gly Thr Cys Gly Gly Gly 

995 iooo 1005 

Ala Cys Gly Thr Gly Thr Cys Gly Cys Ala Thr Gly Ala Thr Cys Cys 
20 1010 1015 1020 

Gly Cys Cys Ala Cys Thr Cys Ala Ala Gly Ala Gly Ala Cys Ala Ala 
1025 1030 1035 1040 

Gly Ala Ala Ala Ala Ala Thr Gly Ala Ala Cys Cys Cys Ala Ala Thr 
1045 1050 1055 

25Cys Cys Thr Cys Ala Gly Ala Gly Gly Thr Thr Thr Gly Ala Cys Cys 
1° 60 1065 1070 

Gly Ala Ala Thr Thr Gly Cys Ala Cys Ala Cys Ala Cys Ala Ala Ala 

1075 1080 1085 

Gly Gly Ala Gly Ala Cys Ala Ala Thr Gly Cys Thr Cys Thr Cys Thr 
30 1090 1095 iioo 

Gly Ala Thr Gly Gly Thr Thr Thr Gly Ala Ala Cys Thr Cys Ala Cys 
1105 mo 1115 1120 

Thr Cys Ala Cys Cys Thr Ala Cys Cys Ala Gly Gly Thr Gly Cys Thr 
H25 H30 1135 

35Gly Gly Ala Thr Gly Thr Ala Cys Ala Gly Ala Gly Ala Thr Ala Cys 
114° 1145 H50 

Cys Cys Ala Thr Thr Gly Thr Ala Thr Ala Cys Cys Cys Ala Ala Ala 

II 55 1160 H65 

Thr Cys Ala Cys Ala Gly Thr Gly Gly Ala Cys Ala Thr Cys Gly Gly 
40 ll 70 1175 H80 

Gly Ala Cys Ala Cys Cys Gly Ala Gly Cys Thr Ala Gly 
ll 85 1190 1195 
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<210> 11 
<211> 42 
<212> DNA 
5<213> Artificial Sequence 

<220> 

<223> A synthetic primer 
10<400> 11 

ccttacgtgc aattgtttgg aggtgtctct gctctaagta aa 

<210> 12 
<211> 36 
15<212> DNA 

<213> Artificial Sequence 

<220> 

<223> A synthetic primer 

20 

<400> 12 

gacacctcca aacaattgca cgtaaggtag gctaaa 

<210> 13 
25<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

3 0<223> A synthetic primer 
<400> 13 

ctaccttacg tgcagatctt tggaggtgtc tctgctctaa g 

35<210> 14 
<211> 40 
<212> DNA 

<213> Artificial Sequence 

40<220> 

<223> A synthetic primer 
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14 

<400> 14 

gacacctcca aagatctgca cgtaaggtag gctaatccaa 

<210> 15 
5<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

10<223> A synthetic primer 
<400> 15 

ggattagcct accatatgtg cagaattttg gaggtgtctc t 

15<210> 16 
<211> 42 
<212> DNA 

<213> Artificial Sequence 

20<220> 

<223> A synthetic primer 

<400> 16 

agagacacct ccaaaattct gcacatctgg taggctaaat cc 

25 

<210> 17 
<211> 39 
<212> DNA 

<213> Artificial Sequence 

30 

<220> 

<223> A synthetic primer 
<400> 17 

35cgcggatccc gcgacctaag acgcctgcct cagctggtc 

<210> 18 
<211> 38 
<212> DNA 
40<213> Artificial Sequence 
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15 



<220> 



<223> A synthetic primer 



<400> 18 



Stggaattcct 



agctcggcgt cccgatgtcc actgtgat 



38 



<210> 19 

<211> 39 

<212> DNA 

10<213> Homo sapiens 

<400> 19 

cgcggatccc gcgacctgag ccgcctgccc caactggtc 

15<210> 20 
<211> 39 
<212> DNA 

<213> Homo sapiens 
20<400> 20 

ccggaattcc tactagctcg gtgtcccgat gtccactgt 

<210> 21 
<211> 8 
25<212> PRT 

<213> Bos taurus 

<400> 21 

Ser Arg Ala Pro Ser Asn Leu Asp 
30 1 5 

<210> 22 
<211> 11 
<212> PRT 
35<213> Bos taurus 

<400> 22 

Lys Pro Arg Ser Arg Ala Pro Ser Asn Leu Asp 



1 



5 



10 



40 
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16 

<210> 23 
<211> 10 
<212> PRT 

<213> Homo sapiens 

5 

<400> 23 

Gin Leu Val Gly Val Ser Thr Pro Leu Gin 
15 10 
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T" later document published after the international Filing date or priority 

date and not in conflict with the application but cited to understand the 
principle or theory underlying the invention 

W X" document of particular relevance; the claimed invention cannot be 

considered novel or cannot be considered to involve an inventive step 
when the document is taken alone 

U Y" document of particular relevance; the claimed invention cannot be 

considered to involve an inventive step when the document is 
combined with one or more other such documents, such combination 
being obvious to a person skilled in the art 

document member of the same patent family 
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Box No. II Observations where certain claims were found unsearchable (Continuation of item 2 of first sheet) 

This international search report has not been established in respect of certain claims under Article 17(2)(a) for the following reasons: 

1. 1 | Claims Nos.: 

because they relate to subject matter not required to be searched by this Authority, namely: 



| | QaimsNos.: 

because they relate to parts of the international application that do not comply with the prescribed requirements to such 
an extent that no meaningful international search can be carried out, specifically: 



| | Claims Nos.: 

because they are dependent claims and are not drafted in accordance with the second and third sentences of Rule 6.4(a). 



Box No. IH Observations where unity of invention is lacking (Continuation of item 3 of first sheet) 



This International Searching Authority found multiple inventions in this international application, as follows: 
Please See Continuation Sheet 



1 . ( | As all required additional search fees were timely paid by the applicant, this international search report covers all 

searchable claims. 

2 - [Zi As 311 searchable claims could be searched without effort justifying an additional fee, this Authority did not invite 
payment of any additional fee. 

3. 1 1 As only some of the required additional search fees were timely paid by the applicant, this international search report 
covers only those claims for which fees were paid, specifically claims Nos. : 



4. I/nJ No required additional search fees were timely paid by the applicant. Consequently, this international search report is 
restricted to the invention first mentioned in the claims; it is covered by claims Nos.: 1-4 and 43-57 

Remark on Protest {^\ The additional search fees were accompanied by the applicant's protest. 

I | No protest accompanied the payment of additional search fees. 
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BOX m. OBSERVATIONS WHERE UNITY OF INVENTION IS LACKING 

This application contains the following inventions or groups of inventions which are not so linked as to form a single general inventive 
concept under PCT Rule 13.1. In order for all inventions to be examined, the appropriate additional examination fees must be paid. 

Group 1, claims 1-4, 43-57 drawn to a purified catalytic domain of a 6 1,4-galactosyltransferase that catalyzes the formation of glucose- 
s' t 4>N-acetylglucosamine bond at a greater than wild type enzyme. 

Group 2, claims 5-7, 43-57 drawn to a purified catalytic domain of a 8 1 ,4-galactosyltransferase that catalyzes the formation of N- 
acetylgalactosamine-B(l ,4)-N-acetylglucosamine bond. 

Group 3, claims 8-10, 43-57 drawn to a purified catalytic domain of a B 1,4-galactosyltransferase that catalyzes the formation of N- 
acetylgalactosamine -8(1,4)- glucose bond in the presence of alpha-lactalbumin. 

Group 4, claims 11-13, 43-57 drawn to a purified catalytic domain of a B 1,4-galactosyltransferase that catalyzes the formation of N- 
acetylglucosamine -8(1,4)- N-acetylglucosamine bond. 

Group 5, claims 14-16, 43-57 drawn to a purified catalytic domain of a 6 1,4-galactosyltransferase that catalyzes the formation of 
mannose-6(l,4)- N-acetylglucosamine bond. 

Group 6, claims 17-19, drawn to a purified catalytic domain of a 8 1,4-galactosyltransferase that catalyzes the formation of galactose- 
6(1,4)- N-acetylglucosamine-6-S03 bond. 

Group 7, claims 20-21, drawn to a polypeptide that promotes in vitro folding of a catalytic domain of a galactosyltransferase. 

Group 8, claims 22-24, drawn to a nucleic acid segment that encodes a catalytic domain of a galactosyltransferase. 

Group 9, claims 25-26, drawn to a method to synthesize a glucose-B(l,4)N-acetylgluc»samine moiety. 

Group 10, claim 27, drawn to an oligosaccharide comprising a glucose-B(l,4)N-acetylglucosamine moiety. 

Group 11, claims 28-29, drawn to a method to synthesize a N-acetylgalactosamine-6(l,4)N-acetylglucosarj^ moiety. 

Grop 12, claim 30, drawn to an oligosaccharide comprising a N-acetylgalactosainine-8(l,4)N-acetylglucosarmne moiety. 
Group 13, claims 31-32, drawn to a method to synthesize a N-acetylgalactosamine-8(l,4)N-glucose moiety. 

Grop 14, claim 33, drawn to an oligosaccharide comprising a N-acetylgalactosamine-fi(l,4)N-glucose moiety. 

Group 15, claims 34-35, drawn to a method to synthesize a N- acetylglucosamine -B(1,4)N- acetylgmcosamine moiety. 

Group 16, claim 30, drawn to an oligosaccharide comprising a N- acetylglucosamine -fi(l,4)N- acetylglucosamine moiety. 
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Group 17, claims 34-35, drawn to a method to synthesize a N- acetylglucosamine -B(1,4)N- acetylglucosamine moiety. 



.teroup 18, claim 36, drawn to an oligosaccharide comprising a N- acetylglucosamine -fl(l,4)N- acetylglucosamine moiety. 
Group 19, claims 37-38, drawn to a method to synthesize a mannose-fl(l,4)N- acetylglucosamine moiety. 
Group 20, claim 39, drawn to an oligosaccharide comprising a mannose-6(l,4)N- acetylglucosamine moiety. 
Group 21, claims 40-41, drawn to a method to synthesize a galactose-B(l,4)N- acetylglucosamine-6-S03 moiety. 
Group 22, claim 42, drawn to an oligosaccharide comprising a mannose-0(l,4)N- acetylglucosamine moiety. 
Group 23, claims 58-60, drawn to kit. 

The inventions listed as Groups 1-23 do not relate to a single general inventive concept under PCT Rule 13.1 because, under PCT Rule 
13.2, they lack the same or corresponding special technical features for the following reasons: The ISA considers mat where multiple 
products and processes are claimed, the main invention shall consist of the first invention of the category first mentioned in the claims 
and the first recited invention of each of the other categories related thereto. Accordingly, the main invention (Group I) comprises the 
first-recited product, a polypeptide having a transferase activity. Furthermore the ISA considers that any feature which the subsequendy 
recited products and methods share with the main invention does not constitute a special technical feature within the meaning of PCT 
Rule 13.2 and that each of such products and methods accordingly defines a separate invention. 

The above application also lacks unity of invention as the invention claimed in the first claim of the first group does not 
contribute over the prior art. The invention of group I is already known in the prior art. (see Jenis et al., Increased activity of a B- 
galactosyltransferase Cancer Biochem. Biophys., 1982, Vol. 6(l):37-45. 
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